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Another, is the class switch recombination (CSR). In the recombination, effector 
functions of antibodies, such as complement fixation, are selected by exchanging exons 
encoding constant region of immunoglobulin heavy chain (Curr. Top. Microbiol. Immunol., 
217:151-169, 1996; Annu. Rev. Immunol., 8:717-735, 1990). 

These two types of genetic editing are very important for effective humoral 
immunoreaction to eliminate harmful microbes. The molecular mechanisms of the genetic 
phenomena have not yet elucidated despite the y extensive studies for several decades. 

The present inventors isolated mouse B cell clone, CH12F3-2, as a research tool to 
elucidate the molecular mechanism of class switch recombination of immunoglobulin. In the 
B cell line, class switch recombination (CSR) from IgM to IgA begins several hours after 
stimulation with IL-4, TGF-p, and CD40L, and finally, over 80% of the cells become IgA 
positive (Immunity, 9:1-10, 1998; Curr. Biol., 8:227-230, 1998; Int. Immunol., 8:193-201, 
1996). 

Using the mouse B cell clone CH12F3-2, the present inventors had reported that the 
breakpoints of CSR distribute not only in switch region (S region), characteristic repeated 
sequences, but also in the neighboring sequences (Curr. Biol., 8:227-230, 1998). However, 
the breakpoints were rarely seen in I exon and C exon, (beating at upstream and downstream 
of S region, respectively. Also, according to the accumulated scientific evidence, it has been 
shown that the transcription of I exon and C exon and the splicing of the transcripts are 
essential for CSR (Cell, 73:1 155-1 164, 1993; Science, 259:984-987, 1993; Proc. Natl. Acad. 
Sci, USA, 90:3705-3709, 1993; Cell, 81:833-836, 1995). 

This namely suggests that the transcripts are involved in CSR either directly or 
indirectly. Accordingly, the present inventors propose a theory that class switch is initiated 
by the recognition of DNA-RNA complex structure and not by the recognition of nucleotide 
sequences of switch region. This idea is further fortified by the fact that even in the case that 
Sa region is substituted with Sa region or Sy region by introducing a mini-chromosome to 
above-mentioned mouse B cell clone CH12F3-2, CSR in the mini-chromosome efficiently 
occurs by stimuli of cytokines (Immunity, 9:1-10, 1998). 

In plants and Protozoa, RNA editing, another type of genetic editing, is widely used 
as a mean for producing functional genes from limited genome (Cell, 81:833-836, 1995; Cell, 
81 :837-840, 1995). mRNA editing of many molecules such as mRNA of apolipoprotein B 



(apoB), AMPA receptors, Wilmstumor-1, a-galactosidase and neurofibromatosis type-1, and 
tRNA-Asp, have been reported (Trends Genet., 12:418-424, 1996; Curr. Opin. Genet. Dev., 
6:221-231, 1996). Although the molecular mechanism of mammalian RNA editing has not 
yet been elucidated, one performed by APOBEC-1 (apolipoprotein B mRNA editing enzyme, 
5 catalytic polypeptide- 1 ) becomes understood by degrees (Science, 260 : 1 8 1 6- 1 8 1 9, 1 993 ; J. 
Biol. Chem., 268:20709-20712, 1993). 

In apoB RNA editing, the first base C (cytosine) of codon CAA, which encodes 
glutamine, is converted to U (uridine), which alters the codon to UAA. As the result, in- 
frame stop codon is made in the apoB mRNA (J. Cell., 81:187-195, 1995; J. Cell., 50:831- 
10 840, 1987; Science, 238:363-266, 1987). apoB-48 and apoB-100 are transcripts of edited 
mRNA and unedited mRNA of apoB, respectively, and these proteins possess totally 
different physiological functions for each other (J. Biol Chem., 271:2353-2356, 1996). 
^ In the site-specific RNA-editing, auxiliary factors are required (Science, 260:1816- 

03 1819, 1993; J. Biol. Chem., 268:20709-20712, 1993). In the absence of auxiliary factors, 

m 15 APOBEC-1 shows only a cytidine deaminase activity, possessing non-specific low affinity to 

J RNA (J. Biol. Chem., 268:20709-20712, 1993; J. Cell., 81:187-195, 1995; J. Biol. Chem., 

yy 

§ 270:14768-14775, 1995; J. Biol. Chem., 270:14762-14767, 1995). The expression and 

activity of the auxiliary factors are found not only in organs with apoB mRNA editing, but 
also in organs with undetectable level of APOBEC-1 expression, or organs without apoB 

J 20 mRNA editing (Science, 260:1816-1819, 1993; J. Biol. Chem., 268:20709-20712, 1993; 

TFT?! 

g Nucleic Acids Res., 22:1874-1879, 1994; Proc Natl. Acad. Sci, USA, 91:8522-8526, 1994; J. 

M= Biol. Chem., 269:21725-21734, 1994). 

The unexpected expression of the auxiliary factors involved in apoB mRNA editing 
suggests that the auxiliary factors may be involved in more general cellular function or other 
25 yet unknown RNA editing. Since there are possibilities that CSR and hypermutation, which 
are genetic editing relating to immunoglobulin, may be accomplished by RNA editing, it is 
very interesting to elucidate whether RNA editing takes place or not in the genetic editing of 
immunoglobulin gene mentioned above. 
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SUMMARY 

The present invention provides ADD (Activation-Induced cytidine Deaminase), a 
novel cytidine deaminase having structural relationship to APOBEC-1, one of RNA editing 
enzyme, and involved in RNA editing in germinal center B cells, where genetic editing of 
5 immunoglobulin gene occurs, and a DNA encoding the enzyme. 

The present inventors intensively searched for novel genes involved in class switch 
recombination (CSR), one of major genetic editing of immunoglobulin gene. As a result, by 
preparing cDNA libraries for mouse B cell clone CH12F3-2, in which class switch 
recombination from IgM to IgA is shown to occur at an extremely high rate together with 

10 activation of cells by stimulation with cytokines, with and without stimulating with cytokine, 
and performing subtraction cloning using the libraries, the present inventors found genes 
encoding mouse- and human-derived novel proteins named AID (Activation-Induced 
cytidine Deaminase), having structural relationship to APOBEC-1, one of RNA editing 
enzymes, and having a cytidine deaminase activity similar to APOBEC-1. 

1 5 The AID protein in the present invention possesses features described below, and 

considered to be a very important RNA-modifying deaminase involved in regulating B cell 
activation, CSR of immunoglobulin gene, somatic hypermutation, and affinity maturation, 
which all are genetic editing specific to germinal center function: 

(1) ORF of cDNA encoding AID protein comprises 198 amino acids, with 24kDa 
20 calculated molecular weight (mouse: SEQ ID NO:2, and human: SEQ ID NO:8). Mouse 

AID protein shows approximately 28kDa molecular weight by SDS-PAGE. 

(2) The amino acid sequence of AID protein is 34% and 26% identical to 
APOBEC-1 (apolipoprotein B mRNA editing enzyme, catalytic polypeptide- 1) at amino acid 
sequence level, for mouse and human derived protein respectively. 

25. (3) AID protein has cytidine/deoxycytidine deaminase motif, which is the active 

center of the deaminase activity conserved in amino acid sequences of proteins belonging to 
cytosine nucleoside/nucleotide deaminase family. 

(4) Cytidine deaminase motif of AID protein is allied with RNA editing 
deaminase subgroup. 
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(5) AID protein has Leucine-rich region considered to be important in protein- 
protein interaction, similar to APOBEC-1. Four leucines in leucine-rich region of the AID 
protein are conserved in leucine-rich region of APOBEC-1 in rabbit, rat, mouse and human. 

(6) In the primary structure of AID protein, all amino acid residues reported to be 
necessary for APOBEC-1 to bind RNA (Phe66DPhe87DHis61 DGlu63 and Cys93) are 
conserved. 

(7) AID protein has pseudoactive site domain in its C terminal for forming 
homodimer, similar to APOBEC-1 and ECDDA, an E. coli derived cytidine deaminase. 
There are possibilities that AID protein forms homodimer, or associates with other auxiliary 
proteins. 

(8) AID protein shows a concentration-dependent cytidine deaminase activity. 
The activity can be inhibited dose dependently by tetrahydrouridine (THU), a specific 
inhibitor of cytidine deaminase. Also, a zinc chelator, 1,10-o-phenanthroline, inhibits the 
cytidine deaminase activity of AID protein while 1,7-o-phenanthroline, the inactive isomer, 
shows a weak inhibition. Thus, AID protein can considered to be a zinc-dependent cytidine 
deaminase as APOBEC-1. 

(9) Strong expression of AID mRNA expression is seen in lymph nodes 
(mesenteric and amygdaline). Also, weak expression in spleen is seen. 

(10) mRNA expression of AID protein is seen in a variety of lymphoid tissues 
(Peyer's patch, mesenteric lymph node, axillary lymph node, spleen, and bone marrow). 
Especially, notable expression is seen in peripheral lymphoid organs, such as lymphatic 
nodes and Peyer's patch. Contrariwise, expression in primary lymphoid organs is lower than 
the peripheral lymphoid organs. 

(11) Expression of AID mRNA is at detection limit level without cytokines (IL-4, 
CD40L, TGF-p) stimulation in mouse B cell clone CH12F3-2, in which the cytokines 
stimulate class switch from IgM to IgA in the cells, whereas the expression is induced 

3 hours after stimulation, and maximal expression is seen after 12 hours, with cytokine 
stimulation. 

(12) AID mRNA expression in mouse B cell clone CH12F3-2 can be induced 
strongly when stimulated with three cytokines, IL-4, CD40L and TGF-p, simultaneously, 
rather than stimulated with any one of them. Also, it can be considered that de novo protein 



synthesis is necessary for augmentation of AID mRNA expression, as the AID mRNA 
expression induction by cytokines in mouse B cell clone CH12F3-2 can be inhibited by 
cycloheximide, an protein synthesis inhibitor. 

(13) In the in vitro test, an augmentation of ADD mRNA expression can be seen 
5 when normal mouse spleen B cells are stimulated with LPS alone, LPS+IL-4, or 

LPS+TGF-p. 

(14) In the in vivo test, when normal mice are immunized with sheep red blood 
cells (SRBC), a significant augmentation of AID mRNA expression can be seen 5days after 
immunization, in which SRBC are known to induce clonal expansion, germinal center 

10 formation, and class switch recombination and affinity maturation of immunoglobulin gene. 

(15) The in vivo augmentation of AID mRNA expression by SRBC immunization 
is specifically seen in splenic CD 19 positive B cells. 

m (16) AID mRNA expression in lymphoid organs is specifically seen in the 

J3 germinal center, enriched with B cells activated by antigen stimulation. 

m 15 (17) Human AID gene locates at locus 12pl3, close to the locus 12pl3.1, where 

m APOBEC-1 gene locates. 

According to the characteristics described above, the AID protein of the present 
invention can be considered to have a function of regulating various biological mechanisms 
required for generation of antigen-specific immunoglobulins (specific antibodies), which 
20 eliminate non-self antigen (foreign antigen, self-reacting cells, etc.) that triggers various 
diseases. The mechanism for generation of immunoglobulin having high specificity to 
antigens includes germinal center functions such as activation of B cells, class switch 
recombination of immunoglobulin gene, somatic hypermutation, and affinity maturation. 
The AID protein of the present invention can be considered to be one of the enzymes that 
25 play an important role in the genetic editing occurring in germinal center B cells (e.g. class 
switch recombination and somatic mutation). 

The dysfunction of the AID protein of the present invention can be the cause for the 
humoral immunodeficiency since it induces failure of germinal center B cell function, such 
as antigen-specific B cell activation, class switch recombination, and somatic mutation. 
30 Reversely, the hyperfunction of AID protein may induce allergy disease or autoimmune 
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disease since it can cause inappropriate B cell activation and needless class switch 
recombination and somatic mutation. 

Therefore, regulation of the function of AID protein and the gene encoding it enables 
preventing and treating various immunodeficiencies, autoimmune diseases, and allergies, 
which result from, for example, B cell dysfunctions (e.g. IgA deficiency, IgA nephropathy, y 
globulinemia, hyper IgM syndrome, etc.) or class switch deficiency of immunoglobulin. 
Thus, the AID protein and the gene encoding the AID protein can be targets for the 
development of drugs for therapy of diseases mentioned above. 

Examples of diseases whose onset prevention, symptom remission, therapy and/or 
symptomatic treatment effect is expected by regulating the function of the ATP protein of the 
present invention or the gene encoding it include, for example, primary immunodeficiency 
syndrome with congenital disorder of immune system, mainly various immunodeficiencies 
considered to develop by B cell deficiency, decrease, or dysfunction (e.g., sex-linked 
agammaglobulinemia, sex-linked agammaglobulinemia with growth hormone deficiency, 
immunoglobulin deficiency with high IgM level, selective IgM deficiency, selective IgE 
deficiency, immunoglobulin heavy chain gene deletion, k chain deficiency, IgA deficiency, 
IgG subclass selective deficiency, CVID (common variable immunodeficiency), infantile 
transient dysgammaglobulinemia, Rosen syndrome, severe combined immunodeficiency 
(sex-linked, autosomal recessive), ADA (adenosine deaminase) deficiency, PNP (purine 
nucleoside phosphorylase) deficiency, MHC class II deficiency, reticular dysplasia, Wiskott- 
Aldrich syndrome, ataxia telangiectasia, DiGeorge syndrome, chromosomal aberration, 
familial Ig hypermetabolism, hyper IgE syndrome, Gitlin syndrome, Nezelof syndrome, 
Good syndrome, osteodystrophy, transcobalamin syndrome, secretory bead syndrome, etc.), 
various diseases with antibody production deficiency that are secondary immunodeficiency 
syndrome with disorder of immune system caused by an acquired etiology (for example, 
AIDS, etc.), and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, 
conjunctivitis, allergic rhinitis, allergic enteritis, drug-induced allergy, food allergy, allergic 
urticaria, glomerulonephritis, etc.). 

Namely, the AID protein of the present invention, a fragment thereof, a DNA 
encoding the AID protein, a fragment thereof, and an antibody against the AID protein are 
useful as reagents for developing drugs for prevention and therapy of such diseases. 



Also, the DNA itself is useful as an antisense drug regulating the function of AID 
gene at a gene level and in a use in gene therapy. The protein or the fragments thereof (e.g. 
enzyme active site) itself is useful as a drug. 

Furthermore, a DNA comprising a complementary nucleotide sequence to an 
arbitrary partial nucleotide sequence in the nucleotide sequence of genomic DNA encoding 
AID protein of the present invention (especially human AID protein) is useful as a primer 
DNA for polymerase chain reaction (PCR). 

An arbitrary partial nucleotide sequence of genomic DNA encoding the AID protein 
(especially human AID protein) of the present invention can be amplified by PCR using the 
primer DNA pair. For example, in the case that mutation or deletion of the nucleotide 
sequence of genomic DNA (especially exon) encoding AID protein is presumed to cause a 
certain immunodeficiency or an allergy, mutations and deletions in the genomic DNA can be 
identified by amplifying an arbitrary partial nucleotide sequence of genomic DNA encoding 
the AID protein obtained from tissue or cells of immunodeficiency or allergy patients by 
PCR using a pair of the primer DNA, by analyzing the presence and the size of PCR 
products, and the nucleotide sequence of the PCR products, and by comparing the nucleotide 
sequence with the corresponding nucleotide sequence in the genomic DNA encoding the AID 
protein derived from the normal human. That is to say, this method is capable of not only, 
for example, elucidating relationships between immunodeficiency or allergy and AID 
protein, but also, in the case where AID protein is the cause of onset of a sort of disease (e.g. 
immunodeficiency and/or allergy), diagnosing the diseases by the methods mentioned above. 

Furthermore, an antibody reactive to the AID protein of the present invention or a 
fragment thereof is extremely useful as an antibody drug by regulating functions of the AID 
protein. 

Furthermore, the gene (DNA), protein, and antibody of the present invention are 
useful as reagents for searching substrates (e.g. RNA, etc.) interacting (binding) with the 
protein (enzyme) of the present invention, or other auxiliary proteins associated with the 
protein of the present invention, and for developing drugs targeting the substrates and 
auxiliary proteins. 

Also, model animals can be generated by disrupting (inactivating) the AID gene base 
on the genetic information on the AID protein derived from mammals (e.g. mouse, etc.), 
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which is one embodiment of the DNA of present invention. By analyzing the physical, 
biological, pathological, and genetic features of the model animal, it is possible to elucidate 
functions of the genes and the proteins of the present invention. 

Furthermore, by introducing normal human AID gene or mutant human AID gene 
5 (e.g. mutant human AID genes derived from immunodeficiency patients), which is one 
embodiment of the present invention, into the model animal whose endogenous gene has 
been disrupted, model animals having only normal or mutant human AID genes of the 
present invention can be generated. By administering drugs (compounds, antibodies, etc.) 
targeting the introduced human AID genes to the model animals, therapeutic effects of the 
10 drugs can be evaluated. 

Furthermore, a method for identifying a substance that regulates production of the 
AID protein of the present invention or transcription of a gene encoding the AID protein into 
^ mRNA, or a substrate that inhibits the enzyme activity of the AID protein (e.g. cytidine 

%0 deaminase activity) are extremely useful as means to develop drugs for therapy and 

1 5 prevention of various diseases (especially, immunodeficiency and/or allergy) in which the 
jjjj above-mentioned AID protein or AID gene is considered to be involved. 

£0 Thus, the present invention, for the first time, provides blow-mentioned DNAs 

™ (cDNAs, genomic DNAs, and an arbitrary fragment thereof), proteins, expression vectors, 

O transformants, antibody pharmaceutical compositions, cells, the use of the DNA fragments as 

ftj 20 primer DNAs, and methods for screening. 

— (1) A DNA or a fragment thereof encoding a protein comprising the amino acid 

sequence of SEQ ID NO:2 or 8. 

(2) The DNA or the fragment of (1), wherein the protein has a cytidine deaminase 
activity. 

25 (3) A DNA or a fragment thereof comprising the nucleotide sequence of SEQ ID 

NO:l or 7. 

(4) A DNA or a fragment thereof comprising a nucleotide sequence of (a) or (b) 

below: 

(a) a nucleotide sequence comprising the nucleotide residues 93 to 689 of 
30 SEQ ID NO: lor 



as 
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(b) a nucleotide sequence comprising the nucleotide residues 80 to 676 of 

SEQIDNO:7. 

(5) A DNA or a fragment thereof of (a) or (b) below: 

(a) a DNA or a fragment thereof that hybridizes under stringent conditions 
5 with a DNA comprising the nucleotide sequence of SEQ ID NO: 1 and that encodes a 

mammal-derived protein being homologous to a protein that comprises the amino acid 
sequence of SEQ ID NO:2 and having a cytidine deaminase activity or 

(b) a DNA or a fragment thereof that hybridizes under stringent conditions 
with a DNA comprising the nucleotide sequence of SEQ ID NO:7 and that encodes a 

10 mammal-derived protein being homologous to a protein that comprises the amino acid 
sequence of SEQ ID NO: 8 and having a cytidine deaminase activity. 

(6) A protein or a fragment thereof comprising the amino acid sequence of SEQ 
ID NO:2 or 8. 

(7) A protein or a fragment thereof comprising substantially the same amino acid 
15 sequence as that of SEQ ID NO: 2 or 8 and having a cytidine deaminase activity. 

(8) A proteins of (a) or (b) below, 
(a) a mammal-derived protein that comprises an amino acid sequence 

encoded by a DNA hybridizing under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO: 1, that is homologous to a protein comprising the amino 
nj 20 acid sequence of SEQ ID NO:2, and that has a cytidine deaminase activity or 
jf? (b) a mammal-derived protein that comprises an amino acid sequence 

M* encoded by a DNA hybridizing under stringent conditions with a DNA comprising the 

nucleotide sequence of SEQ ID NO:7, that is homologous to a protein comprising the amino 
acid sequence of SEQ ID NO:8, and that has a cytidine deaminase activity. 
25 (9) An expression vector comprising the DNA or the fragment of any one of (1) 

to (5). 

(10) A transformant transformed with the expression vector of (9). 

(11) An antibodies or a portion thereof reactive to the protein of any one of (6) to 
(8) or to a fragment of the protein. 

30 (12) The antibodies or the portion of (1 1), wherein the antibody is a monoclonal 

antibody. 
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(13) A pharmaceutical composition comprising the antibody or the portion of (1 1) 
or (12), and a pharmaceutically acceptable carrier. 

(14) A cell producing a monoclonal antibody reactive to the protein of any one of 
(6) to (8) or to a fragment of the protein. 

(15) The cell of (14), wherein the cell is a hybridoma obtained by fusing, with a 
mammal-derived myeloma cell, a non-human mammal-derived B cell that produces a 
monoclonal antibody. 

(16) The cell of (15), wherein the cell is a transgenic cell transformed by 
introducing, into a cell, either or both of a DNA encoding a heavy chain of the monoclonal 
antibody and a DNA encoding a light chain of the monoclonal antibody. 

(17) A genomic DNA or a fragment thereof comprising a nucleotide sequence of 
any one of (a) to (c) below: 

(a) SEQIDNO:9, 

(b) SEQIDNO:10,or 

(c) SEQIDNO:35. 

(18) A genomic DNA or a fragment thereof comprising a nucleotide sequence of 
any one of (a) to (e) below: 

(a) SEQIDNO:ll, 

(b) SEQIDNO:12, 

(c) SEQIDNO:13, 

(d) SEQIDNO:14,or 

(e) SEQIDNO:15. 

(19) A DNA comprising a complementary nucleotide sequence to an arbitrary 
partial nucleotide sequence of a nucleotide sequence of any one of (a) to (h) below: 



(a) 


SEQ ID NO:9, 


(b) 


SEQIDNO:10, 


(c) 


SEQIDNO:ll, 


(d) 


SEQ ID NO: 12, 


(e) 


SEQ ID NO: 13, 


(0 


SEQ ID NO: 14, 


(g) 


SEQ ID NO: 15, or 
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(h) SEQ ID NO:25. 
(20) The DNA of (19), wherein the DNA comprises a nucleotide sequence of any 
one of (a) to (q) below: 



(a) 


SEQ ID NO: 18, 


(b) 


SEQ ID NO: 19, 


(c) 


SEQ ID NO:20, 


(d) 


SEQIDNO:21, 


(e) 


SEQ ID NO:22, 


(f) 


SEQ ID NO:23, 


(g) 


SEQ ID NO:24, 


(h) 


SEQ ID NO:25, 


(i) 


SEQ ID NO:26, 


(i) 


SEQ ID NO:27, 


Art 


cpO IT) NO-IS 


(i) 


SEQ ID NO:29, 


(m) 


SEQ ID NO:30, 


(n) 


SEQIDNO:31, 


(o) 


SEQ ID NO:32, 


(P) 


SEQ ID NO:33, or 


(q) 


SEQ ID NO:34. 



(21) Use of the DNA of (19) or (20) as a primer DNA in polymerase chain 
reaction. 

(22) Use of a pair of DNA of any one of (a) to (n) below as primer DNAs in 
polymerase chain reaction: 

(a) a DNA comprising the nucleotide sequence of SEQ ID NO:3 1 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:32, 

(b) a DNA comprising the nucleotide sequence of SEQ ID NO:20 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:22, 

(c) a DNA comprising the nucleotide sequence of SEQ ID NO:21 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:30, 
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(d) a DNA comprising the nucleotide sequence of SEQ ED NO:24 and a 
DNA comprising the nucleotide sequence of SEQ ED NO:25, 

(e) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:27, 

(f) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a 
DNA comprising the nucleotide sequence of SEQ ID NO;28, 

(g) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(h) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:27, 

(i) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:28, 

(j) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(k) a DNA comprising the nucleotide sequence of SEQ ID NO:34 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:28, 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO:34 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, 

(m) a DNA comprising the nucleotide sequence of SEQ ID NO:33 and a 
DNA comprising the nucleotide sequence of SEQ ID NO:29, or, 

(n) a DNA comprising the nucleotide sequence of SEQ ID NO: 1 8 and a 
DNA comprising the nucleotide sequence of SEQ ID NO: 19. 

(23) A method for identifying a substance that regulates transcription of a gene 
encoding an AID protein comprising the amino acid sequence of SEQ ID NO:2 or 8 into 
mRNA, or production of the AID protein, the method comprising the steps of: 

(a) culturing, separately in the presence and the absence of the substance, 
cells producing the AID protein and 

(b) (i) comparing the level of the AID protein produced by the cells 
cultured in the presence of the substance with the level of the AID protein produced by the 
cells cultured in the absence of the substance or 
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(ii) comparing the level of the AID protein-encoding mRNA 
transcribed in the cells cultured in the presence of the substance with the level of the AID 
protein-encoding mRNA transcribed in the cells cultured in the absence or the substance. 

(24) A method for identifying a substance that regulates transcription of a gene 
encoding an AID protein comprising the amino acid sequence of SEQ ID NO:2 or 8 into 
mRNA, or production of the AID protein, the method comprising the steps of: 

(a) culturing, separately in the presence and the absence of the substance, 
cells producing the AID protein and a protein other than the AID protein, wherein 
transcription of a gene encoding the other protein into mRNA is dependent in the cells on the 
degree of a signal of transcription of the gene encoding the AID protein into mRNA and 

(b) comparing the level of the other protein produced by the cells cultured 
in the presence of the substance with the level of the other protein produced by the cells 
cultured in the absence of the substance. 

(25) The method of (23) or (24), wherein the cells are transgenic cells transformed 
with a gene encoding the protein. 

(26) The method of (24), wherein the cells are transgenic cells transformed with a 
gene encoding the protein and a gene encoding the other protein. 

(27) The method of (26), wherein the protein is a reporter protein. 

(28) The method of (27), wherein comparison of the level of the other protein is 
comparison of the level of a signal generated by the reporter protein. 

(29) The method of (27) or (28), wherein the reporter protein is luciferase. 

(30) A method for identifying a substance that inhibits an enzyme activity of an 
AID protein comprising the amino acid sequence of SEQ ID NO: 2 or 8, the method 
comprising the step of (a) or (b) below: 

(a) culturing, separately in the presence and the absence of the substance, 
mammal-derived B cells or tissues comprising the B cells, and comparing enzyme activities 
of the AID protein in the B cells separately cultured or 

(b) (i) administering the substance separately to an AID gene knockout 
mouse whose endogenous AID gene is inactivated so that transcription of the endogenous 
AID gene into mRNA is inhibited, and to a normal mouse and 
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(ii) comparing enzyme activities of the AID proteins in the B cells 
isolated from the respective mice. 

(31) The method of (30), wherein the enzyme activity is a cytidine deaminase 
activity. 

Hereafter, the present invention is explained in detail, by clarifying the terms used in 
the present invention and general methods for producing the proteins, DNAs, antibodies, and 
cells of the present invention. 

The "protein or a fragment thereof means a protein and a fragment thereof derived 
from a mammal such as human, bovine, sheep, pig, goat, rabbit, rat, hamster, guinea pig, 
mouse, and so on, preferably a protein or a fragment thereof derived from human, rabbit, rat, 
or mouse, and particularly preferably, a protein of a fragment thereof derived from human or 
mouse. 

As a particularly preferred embodiment, it means any protein or a fragment thereof 

below. 

(1) A protein or a fragment thereof comprising the amino acid sequence of SEQ 
ID NO:2 or 8. 

(2) A protein or a fragment thereof comprising substantially the same amino acid 
sequence as that of SEQ ID NO:2 or 8 and having a cytidine deaminase activity. 

(3) A mammal-derived protein that comprises an amino acid sequence encoded 
by a DNA hybridizing under stringent conditions with a DNA comprising the nucleotide 
sequence of SEQ ID NO:l, that is homologous to a protein comprising the amino acid 
sequence of SEQ ID NO:2, and that has a cytidine deaminase activity. 

(4) A mammal-derived protein that comprises an amino acid sequence encoded 
by a DNA hybridizing under stringent conditions with a DNA comprising the nucleotide 
sequence of SEQ ID NO:7, that is homologous to a protein comprising the amino acid 
sequence of SEQ ID NO:8, and that has a cytidine deaminase activity. 

Here, "having substantially the same amino acid sequence" means that a protein has 
an amino acid sequence where multiple amino acids, preferably 1 to 10 amino acids, 
particularly preferably 1 to 5 amino acids, in the amino acid sequence shown in the 
references are substituted, deleted, and/or modified, and that a protein has an amino acid 
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sequence where multiple amino acids, preferably 1 to 10 amino acids, particularly preferably 
1 to 5 amino acids, are added to the amino acid sequence shown in the references. 

The protein of the present invention includes monomer molecule, homodimer in 
which one strand binds to another strand comprising an identical amino acid sequence, 
heterodimer in which one strand binds to another strand comprising a different amino acid 
sequence, and oligomers such as trimer or tetramer. 

Also, the "fragment of a protein" means an arbitrary partial sequence (fragment) in 
the amino acid sequence that the above-mentioned AID protein of the present invention 
comprises. For example, it includes an enzyme active site required for the AID protein to 
exert an enzyme activity represented by a cytidine deaminase activity, and an interaction site 
required for the AID protein to bind or associate with substrates (e.g. mRNA, etc.) or various 
auxiliary proteins. 

Alphabetical triplet or single letter codes used to represent amino acids in the present 
specification or figures mean amino acids as follows: 

(Gly/G), glycine; (Ala/A), alanine; (Val/V), valine; (Leu/L), leucine; (Ile/I), 
isoleucine; (Ser/S), serine; (Thr/T), threonine; (Asp/D), aspartic acid; (Glu/E), glutamic acid; 
(Asn/N), asparagines; (Gln/Q) glutamine; (Lys/K), lysine; (Arg/R), arginine; (Cys/C), 
cysteine; (Met/M), methionine; (Phe/F), phenylalanine; (Tyr/Y), tyrosine; (Trp/W), 
tryptophan; (His/H), histidine; (Pro/P), proline. 

The proteins and fragments of the present invention can be produced by properly 
using, in addition to genetic engineering technique mentioned below, methods well known in 
the art, such as chemical synthesis, cell culture method, and so on, or their modified methods. 

Also, the AID protein of the present invention can be produced as a recombinant 
fusion protein with other protein (e.g. GST (Glutathione S-transferase), etc.). In this case, the 
fusion protein is advantageous in that it can be extremely easily purified by affinity 
chromatography employing adsorbent on which other protein molecule binding specifically 
to GST is immobilized. Moreover, since various antibodies reactive to GST are provided, the 
quantification of the fusion protein can be simply carried out by immunoassay (e.g. ELISA, 
etc.) using the antibodies against GST. 

The DNA of the present invention is a DNA encoding protein of the present invention 
and a fragment thereof, and it includes any nucleotide sequence encoding the protein of the 
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present invention and includes both genomic DNAs and cDNAs. Also, the DNA includes 
any DNA composed of any codons as long as the codons encode identical amino acids. 

Also, the DNA of the present invention includes a DNA encoding mammal AID 
protein, and, as a preferred embodiment, a DNA encoding mouse AID protein or human AID 
protein can be exemplified. 

Examples of specific embodiments are as follows: 

(1) A DNA encoding a protein comprising the amino acid sequence of SEQ ID NO:2 or 
8. 

(2) The DNA of (1), wherein the protein has a cytidine deaminase activity. 

(3) A DNA comprising the nucleotide sequences of SEQ ID NO: 1 or 7. 

(4) A DNA comprising the nucleotide residues 93 to 689 of SEQ ID NO: 1 . 

(5) A DNA comprising the nucleotide residues 80 to 676 of SEQ ID NO:7. 

(6) A DNA that hybridizes under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO:l and that encodes a mammal-derived protein being 
homologous to a protein that comprises the amino acid sequence of SEQ ID NO:2 and 
having a cytidine deaminase activity. 

(7) A DNA that hybridizes under stringent conditions with a DNA comprising the 
nucleotide sequence of SEQ ID NO: 7 and that encodes a mammal-derived protein being 
homologous to a protein that comprises the amino acid sequence of SEQ ID NO: 8 and 
having a cytidine deaminase activity. 

(8) A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one 
of (a) to (c) below: 

(a) SEQ ID NO:9, 

(b) SEQIDNO:10,or 

(c) SEQIDNO:35. 

(9) A genomic DNA or a fragment thereof comprising a nucleotide sequence of any one 
of (a) to (e) below: 

(a) SEQIDNO:ll, 

(b) SEQ ID NO: 12, 

(c) SEQ ID NO: 13, 

(d) SEQ ID NO: 14, or 
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(e) SEQIDN0:15. 
(10) A DNA comprising a complementary nucleotide sequence to an arbitrary partial 
sequence of a nucleotide sequence of any one of (a) to (h) below: 





(a) 


SEQ ED NO:9, 




\ U J 


SFO TD NO- 10 




(c) 


SEQDDNO:ll, 




(d) 


SEQIDNO:12, 




(e) 


SEQ ID NO: 13, 




(f) 


SEQ ID NO: 14, 


10 


(g) 


SEQ ID NO: 15, or 




(h) 


SEQIDNO:35. 



(11) A DNA comprising a nucleotide sequence of any one of (a) to (q) below: 



; 5 




(a) 


SEQ ID NO: 18, 








SEO ID NO- 19 




15 


(c) 


SEQ ID NO:20, 






(d) 


SEQIDNO:21, 






(e) 


SEQ ID NO:22, 


5 




(f) 


SEQ ID NO:23, 






(g) 


SEQ ID NO:24, 


ni 


20 


00 


SEQ ID NO:25, 


D 




0) 


SEQ ID NO:26, 






0) 


SEQ ID NO:27, 






00 


SEQ ID NO:28, 






(1) 


SEQIDNO:29, 




25 


(m) 


SEQ ID NO:30, 






(n) 


SEQIDNO:31, 






(o) 


SEQ ID NO:32, 






(P) 


SEQIDNO:33,or, 






(q) 


SEQIDNO:34. 



30 Furthermore, a DNA encoding a mutant protein or a fragment thereof obtained by 

substituting, deleting, and/or modifying multiple amino acids, preferably 1 to 10 amino acids, 



particularly preferably 1 to 5 amino acids, or by inserting multiple amino acids, preferably 
1 to 10 amino acids, particularly preferably 1 to 5 amino acids in the amino acid sequence 
constituting the above-defined AID protein of the present invention or a fragment thereof is 
included in the DNA of the present invention. 

The term "under stringent conditions" used herein means, for example, the following 
conditions. For example, in the case of carrying out hybridization using a probe with not less 
than 50 bases in 0.9% NaCl, target temperature of causing 50% dissociation (Tm) can be 
calculated from the formula below, and the hybridization temperature can be set as the 
formula below. 

Tm = 82.3°C+0.41'(G+C)%-500/n-0.61x (formamide)% 

(n means the number of bases of the probe) 
Temperature = Tm-25°C 
Also, in the case of using a probe with not less than 100 bases (G+C = 40 to 50%), 
the changes of Tm as (1) and (2) below can be used as the indicator. 

(1) Every 1% mismatch decreases Tm by approximately 1°C. 

(2) Every 1% formamide decreases Tm by 0.6 to 0.7°C. 
Thus, the temperature condition in the case of combination of complete 

complementary strands can be set as below. 

(A) 65 to 75°C (without formamide) 

(B) 35 to 45°C (with 50% formamide) 

The temperature condition in the case of combination of incomplete complementary 
strands can be set as below. 

(A) 45 to 55°C (without formamide) 

(B) 35 to 42°C (with 30% formamide) 

In the case of using probes with not more than 23 bases, temperature can be 37°C, or 
the formula below can also be used as an indicator. 

Temperature = 2°C x (number of A+T)+4°C x (number of C+G)-5°C 

The DNA of the present invention can be a DNA obtained by any method. For 
example, the DNA includes complementary DNA (cDNA) prepared from mRNA, DNA 
prepared from genomic DNA, DNA prepared by chemical synthesis, DNA obtained by PCR 
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amplification with RNA or DNA as a template, and DNA constructed by appropriately 
combining these methods. 

As used herein, an "isolated nucleic acid" is a nucleic acid, the structure of which is 
not identical to that of any naturally occurring nucleic acid or to that of any fragment of a 
naturally occurring genomic nucleic acid spanning more than three genes. The term 
therefore covers, for example, (a) a DNA which has the sequence of part of a naturally 
occurring genomic DNA molecule but is not flanked by both of the coding sequences that 
flank that part of the molecule in the genome of the organism in which it naturally occurs; (b) 
a nucleic acid incorporated into a vector or into the genomic DNA of a prokaryote or 
eukaryote in a manner such that the resulting molecule is not identical to any naturally 
occurring vector or genomic DNA; (c) a separate molecule such as a cDNA, a genomic 
fragment, a fragment produced by polymerase chain reaction (PCR), or a restriction 
fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene 
encoding a fusion protein. Specifically excluded from this definition are nucleic acids 
present in random, uncharacterized mixtures of different DNA molecules, transfected cells, 
or cell clones, e.g., as these occur in a DNA library such as a cDNA or genomic DNA library. 

The term "substantially pure" as used herein in reference to a given polypeptide 
means that the polypeptide is substantially free from other biological macromolecules. For 
example, the substantially pure polypeptide is at least 75%, 80, 85, 95, or 99% pure by dry 
weight. Purity can be measured by any appropriate standard method known in the art, for 
example, by column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 

The invention also includes a polypeptide, or fragment thereof, that differs from the 
corresponding sequence shown as SEQ ID NO:2 or 8. The differences are, preferably, 
differences or changes at a non-essential residue or a conservative substitution. In one 
embodiment, the polypeptide includes an amino acid sequence at least about 60% identical to 
a sequence shown as SEQ ID NO:2 or 8, or a fragment thereof. Preferably, the polypeptide is 
at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more identical to SEQ ID 
NO:2 or 8 and has at least one cytidine deaminase function or activity described herein. 
Preferred polypeptide fragments of the invention are at least 10%, preferably at least 20%, 
30%, 40%, 50%, 60%, 70%, or more, of the length of the sequence shown as SEQ ID NO:2 
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or 8 and have at least one cytidine deaminase activity described herein. Or alternatively, the 

fragment can be merely an immunogenic fragment. 

As used herein, "% identity" of two amino acid sequences, or of two nucleic acid 

sequences, is determined using the algorithm of Karlin and Altschul (PNAS USA 87:2264- 
5 2268, 1990), modified as in Karlin and Altschul, PNAS USA 90:5873-5877, 1993). Such an 

algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al. (J. 

Mol. Biol. 215:403-410, 1990). BLAST nucleotide searches are performed with the 

NBLAST program, score = 100, wordlength =12. BLAST protein searches are performed 

with the XBLAST program, score =50, wordlength=3. To obtain gapped alignment for 
10 comparison purposes GappedBLAST is utilized as described in Altschul et al (Nucleic Acids 

Res. 25:3389-3402, 1997). When utilizing BLAST and GappedBLAST programs the default 

parameters of the respective programs (e.g., XBLAST and NBLAST) are used to obtain 
™ nucleotide sequences homologous to a nucleic acid molecule of the invention. 

S Accordingly, in one aspect, the invention provides an isolated or purified nucleic acid 

jp s 15 molecule that encodes a polypeptide described herein or a fragment thereof. Preferably, the 

isolated nucleic acid molecule includes a nucleotide sequence that is at least 60% identical to 
ED the nucleotide sequence shown in SEQ ID NO:l or 7. More preferably, the isolated nucleic 

'f acid molecule is at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 

B 96%, 97%, 98%, 99%, or more, identical to the nucleotide sequence shown in SEQ ID NO:l 

pj 20 or 7. In the case of an isolated nucleic acid molecule which is longer than or equivalent in 
S length to the reference sequence, e.g., SEQ ID NO:l or 7, the comparison is made with the 

M" full length of the reference sequence. Where the isolated nucleic acid molecule is shorter that 

the reference sequence, e.g., shorter than SEQ ID NO:l or 7, the comparison is made to a 

segment of the reference sequence of the same length (excluding any loop required by the 
25 homology calculation). 

The DNA encoding the protein of the present invention can be prepared by the usual 

methods: cloning cDNA from mRNA encoding the protein of the present invention, isolating 

genomic DNA and splicing it, chemical synthesis, and so on. 



30 (1) cDNA can be cloned from mRNA encoding the protein of the present invention by, for 
example, the method described below. 
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First, the mRNA encoding the protein of the present invention is prepared from the 
above-mentioned tissues or cells expressing and producing the protein of the present 
invention. mRNA can be prepared by isolating total RNA by a known method such as 
guanidine-thiocyanate method (Chirgwin et al., Biochemistry, 18:5294, 1979), hot phenol 
5 method, or AGPC method, and subjecting it to affinity chromatography using oligo-dT 
cellulose or poly-U Sepharose. 

Then, with the mRNA obtained as a template, cDNA is synthesized, for example, by 
a well-known method using reverse transcriptase, such as the method of Okayama et al (Mol. 
Cell. Biol. 2:161 (1982); Mol. Cell. Biol. 3:280 (1983)) or the method of Hoffman et al. 
10 (Gene 25:263 (1983)), and converted into double-stranded cDNA. A cDNA library is 
prepared by transforming E. coli with plasmid vectors, phage vectors, or cosmid vectors 
having this cDNA or by transfecting E. coli after in vitro packaging. 

The plasmid vectors used in this invention are not limited as long as they are 
replicated and maintained in hosts. Any phage vector that can be replicated in hosts can also 
15 be used. Examples of usually used cloning vectors are pUC19, ?igtl0, ^gtll, and so on. 

When the vector is applied to immunological screening as mentioned below, a vector having 
a promoter that can express a gene encoding the desired protein in a host is preferably used. 

cDNA can be inserted into a plasmid by, for example, the method of Maniatis et al. 
(Molecular Cloning, A Laboratory Manual, second edition, Cold Spring Harbor Laboratory, 
U 20 p. 1 .53, 1989). cDNA can be inserted into a phage vector by, for example, the method of 
5 Hyunh et al. (DNA cloning, a practical approach, 1, p.49 (1985)). These methods can be 

~ simply performed by using a commercially available cloning kit (for example, a product from 

Takara Shuzo). The recombinant plasmid or phage vector thus obtained is introduced into an 
appropriate host cell such as a prokaryote (for example, E. coli: HB101, DH5a, MC1061/P3, 
25 etc). 

Examples of a method for introducing a plasmid into a host are, calcium chloride 
method, calcium chloride/rubidium chloride method and electroporation method, described 
in Molecular Cloning, A Laboratory Manual (second edition, Cold Spring Harbor 
Laboratory, p. 1 .74 (1989)). Phage vectors can be introduced into host cells by, for example, 
30 a method in which the phage DNAs are introduced into grown hosts after in vitro packaging. 
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In vitro packaging can be easily performed with a commercially available in vitro packaging 
kit (for example, a product from Stratagene or Amersham). 

The identification of cDNA encoding protein, its expression being augmented 
depending on the stimulation of cytokines like AID protein of the present invention, can be 
carried out by for example suppression subtract hybridization (SSH)(Proc. Natl. Acad. Sci. 
USA, 93:6025-6030, 1996; Anal. Biochem., 240:90-97, 1996) taking advantage of 
suppressive PCR effect (Nucleic Acids Res., 23:1087-1088, 1995), using two cDNA 
libraries, namely, cDNA library constructed from mRNA derived from stimulated cells 
(tester cDNA library) and that constructed from mRNA derived from unstimulated cells 
(driver cDNA library). 

The preparation of cDNA libraries required for subtraction cloning can be performed 
by using commercially available kit, for example, PCR-Select Subtraction Kit (CLONTECH, 
cat: Kl 804-1). The experiment can be performed according to the document of procedure 
accompanying in the kit. 

An example of practical experimental procedure is listed below, briefly. 

PolyA + RNA is prepared from cells with or without stimulation with appropriate 
stimulant as previously reported method (Nucleic Acids Res., 26:91 1-918, 1998). Next, 
cDNA is prepared using reverse transcriptase from each polyA + RNA samples, as is the 
commonly used method. cDNA prepared from stimulated cells is used as tester cDNA and 
that prepared from unstimulated cells as driver cDNA. 

According to the previous report mentioned above and experimental manuals 
accompanying with kit, driver cDNA is added to tester cDNA to perform subtraction. The 
efficiency of subtraction is monitored by adding small amount of exogenous DNA as a 
control. After subtraction, the exogenous DNA is concentrated. 

The subtracted cDNA is cloned into appropriate plasmid expression vector to 
construct a plasmid library by commonly used method. 

Similar to the previous report, many colonies are screened by differential 
hybridization method (Nucleic Acids Res., 26:91 1-918, 1998; RINSYO-MEN-EKI, 29:451- 
459, 1997). Here, as the hybridization probes, tester cDNA and driver cDNA mentioned 
above labeled with radioisotope can be used. Clones containing the objective DNA and that 
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containing exogenous DNA can be distinguished by hybridizing the exogenous DNA with 
replicant filters. 

Objective cDNA or its fragment can be obtained by selecting clones giving strong 
signal against radiolabeled tester cDNA probe rather than radiolabeled driver cDNA probe. 

Also, cDNA encoding the protein of the present invention can be accomplished by 
other general cDNA screening method. 

For instance, cDNA or its fragment encoding the protein of the present invention 
cloned by subtraction cloning method mentioned above, or chemically synthesized 
oligonucleotides corresponding to amino acid sequence of the protein of the present 
invention, are labeled with 32 P to make probes, then by well-known colony hybridization 
method (Crunstein et al., Proc. Natl. Acid. Sci. USA, 72:3961, 1975) or plaque hybridization 
method (Molecular Cloning, A Laboratory Manual, second edition, Cold Spring Harbor 
Laboratory, p.2.108, 1989), commercial or originally prepared cDNA libraries can be 
screened. Furthermore, a method to amplify DNA including cDNA encoding the protein of 
the present invention by PCR, by constructing a pair of PCR primer based on cDNA or its 
fragment encoding the protein of the present invention isolated by the subtraction cloning 
mentioned above, can be listed. 

When a cDNA library prepared using a cDNA expression vector is used, the desired 
clone can be screened by the antigen-antibody reaction using an antibody against the desired 
protein. A screening method using PCR method is preferably used when many clones are 
subjected to screening. 

The nucleotide sequence of the DNA thus obtained can be determined by Maxam- 
Gilbert method (Maxam et al. Proc. Natl. Acad. Sci. USA, 74:560 (1977)) or the 
dideoxynucleotide synthetic chain termination method using phage M13 (Sanger et al. Proc. 
Natl. Acad. Sci. USA, 74:5463-5467 (1977)). The nucleotide sequence can be easily 
determined using a commercial DNA sequencer. 

The whole or a part of the gene encoding the protein of the present invention can be 
obtained by excising the clone obtained as mentioned above with restriction enzymes and so 
on. 
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(2) Also, the DNA encoding the protein of the present invention can be isolated from the 
genomic DNA derived from the cells expressing the protein of the present invention as 
mentioned above by the following methods. 

Such cells are solubilized preferably by SDS or proteinase K, and the DNAs are 
deproteinized by repeating phenol extraction. RNAs are digested preferably with 
ribonuclease. The DNAs obtained are partially digested with appropriate restriction 
enzymes, and the DNA fragments obtained are amplified with appropriate phage or cosmid 
to generate a library. Then, clones having the desired sequence are detected, for example, by 
using radioactively labeled DNA probes, and the whole or a portion of the gene encoding the 
protein of the present invention is obtained from the clones by excision with restriction 
enzymes, etc. 

For example, cDNA encoding a human-derived protein can be obtained by preparing 
a cosmid library into which human genomic DNAs (chromosomal DNAs) are introduced 
("Laboratory Manual Human Genome Mapping," M. Hori and Y. Nakamura, eds., 
Maruzen), screening the cosmid library to obtain positive clones containing DNA 
corresponding to the coding region of the desired protein, and screening the above cDNA 
library using the coding region DNA excised from the positive clones as a probe. 

Also, the present invention relates to any fragment of DNA (cDNA, genomic DNA, 
etc.) encoding AID protein (especially human AID protein) of the present invention 
described above. DNA with complementary nucleotide sequence to any nucleotide sequence 
of cDNA or genomic DNA is useful as a primer DNA in polymerase chain reaction (PCR). 
By PCR using a pair of the primer DNA, any partial nucleotide sequence of genomic DNA 
encoding AID protein (especially human AID protein) of the present invention can be 
amplified. 

For instance, in the case that mutation or deletion of genomic DNA (especially exon) 
encoding the AID protein is presumed to cause a certain immunodeficiency or allergy, the 
existence of such the mutation or deletion can be analyzed by PCR described below. 

(1) Prepare a pair of primers comprising complementary nucleotide sequence to 
any partial nucleotide sequence of genomic DNA encoding AID protein of the present 
invention. 
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(2) Amplify the objective partial nucleotide sequence of the genomic DNA using 
the pair of primers, using genomic DNA encoding AID protein obtained from tissue or cells 
of immunodeficiency or allergy patients as templates. 

(3) Analyze the existence of PCR products and the nucleotide sequence of the 

5 PCR products, and identify the mutation and deletion in the genomic DNA by comparing the 
nucleotide sequence and corresponding nucleotide sequence of genomic DNA encoding ATP 
protein derived from normal human. 

Thus, the method described above can not only elucidate, for example, the relation 
between immunodeficiency and/or allergy and AID protein, but also be used for the 
10 diagnosis of a certain kind of disease, in the case that AID protein is the cause of the disease. 
Examples of the nucleotide sequence of the primer DNA are as follows: 
(1) A DNA comprising a complementary nucleotide sequence to an arbitrary partial 
sequence of a nucleotide sequence of any one of (a) to (h) below: 





(a) 


SEQIDNO:9, 


i 15 


(b) 


SEQ ID NO: 10, 




(c) 


SEQIDNO:ll, 


ULi 
ft 


(d) 


SEQIDNO:12, 




(e) 


SEQIDNO:13, 


jj i 


(f) 


SEQIDNO:14, 


ra 20 

ffs 


(g) 


SEQ ID NO: 15, or 




(h) 


SEQIDNO:35. 



(2) A DNA comprising a nucleotide sequence of any one of (a) to (q) below: 





(a) 


SEQIDNO:18, 




(b) 


SEQIDNO:19, 


25 


(c) 


SEQIDNO:20, 




(d) 


SEQIDNO:21, 




(e) 


SEQ ID NO:22, 




(f) 


SEQ ID NO:23, 




(g) 


SEQ ID NO:24, 


30 


(h) 


SEQ ID NO:25, 




0) 


SEQ ID NO:26, 
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0) SEQ ID NO:27, 
(k) SEQIDNO:28, 
(1) SEQ ID NO:29, 
(m) SEQIDNO:30, 
(n) SEQIDNO:31, 

(0) SEQ ID NO:32, 
(p) SEQ ID NO:33, or, 
(q) SEQ ID NO:34. 

Also, the present invention relates to the use of the above-mentioned DNA fragment 
as a primer DNA in polymerase chain reaction. 

Examples of the combination of primer DNAs for PCR in diagnosis accomplished by 
PCR gene amplification and by analyzing it are as follows: 

(1) a DNA comprising the nucleotide sequence of SEQ ID NO:3 1 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:32, 

(2) a DNA comprising the nucleotide sequence of SEQ ID NO:20 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:22, 

(3) a DNA comprising the nucleotide sequence of SEQ ID NO:21 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:30, 

(4) a DNA comprising the nucleotide sequence of SEQ ID NO:24 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:25, 

(5) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:27, 

(6) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:28, 

(7) a DNA comprising the nucleotide sequence of SEQ ID NO:23 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:29, 

(8) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:27, 

(9) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:28, 
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(10) a DNA comprising the nucleotide sequence of SEQ ID NO:26 and a DNA 
comprising the nucleotide sequence of SEQ ID NO: 29, 

(11) a DNA comprising the nucleotide sequence of SEQ ID NO: 34 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:28, 

(12) a DNA comprising the nucleotide sequence of SEQ ID NO:34 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:29, 

(13) a DNA comprising the nucleotide sequence of SEQ ID NO:33 and a DNA 
comprising the nucleotide sequence of SEQ ID NO:29, or, 

(14) a DNA comprising the nucleotide sequence of SEQ ID NO: 18 and a DNA 
comprising the nucleotide sequence of SEQ ID NO: 19. 

Moreover, the present invention also relates to a recombinant vector comprising the 
DNA encoding the protein of the present invention. As a recombinant vector of the present 
invention, any vector can be used as long as it is capable of retaining replication or self- 
multiplication in each host cell of prokaryotic and/or eukaryotic cells, including plasmid 
vectors and phage vectors. 

The recombinant vector can easily be prepared by ligating the DNA encoding protein 
of the present invention with a vector for recombination available in the art (plasmid DNA 
and bacteriophage DNA) by the usual method. 

Specific examples of the vectors for recombination used are E. co/z-derived plasmids 
such as pBR322 3 pBR325, pUC12, pUC13, and pUC19, yeast-derived plasmids such as 
pSH19 and pSH15, and Bacillus subtilis-derived plasmids such as pUBl 10, pTP5, and 
pC194. Examples of phages are a bacteriophage such as X phage, and an animal or insect 
virus (pVL1393, Invitrogen) such as a retrovirus, vaccinia virus, and nuclear polyhedrosis 
virus. 

An expression vector is useful for expressing the DNA encoding the protein of the 
present invention and for producing the protein of the present invention. The expression 
vector is not limited as long as it expresses the gene encoding the protein of the present 
invention in various prokaryotic and/or eukaryotic host cells and produces this protein. 
Examples thereof are pMAL C2, pEF-BOS (Nucleic Acids Res. 18:5322 (1990) and so on), 
pME18S (Experimental Medicine: SUPPLEMENT, "Handbook of Genetic Engineering" 
(1992) and so on), etc. 
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Also, the protein of the present invention can be produced as a fusion protein with 
other protein. It can be prepared as a fusion protein, for example, with GST (Glutathione 
S-transferase) by subcloning a cDNA encoding the protein of the present invention, for 
example, into plasmid pGEX4Tl (Pharmacia), by transforming E. coli DH5a, and by 
culturing the transformant. 

When bacteria, particularly E. coli are used as host cells, an expression vector 
generally comprises, at least, a promoter/operator region, an initiation codon, the DNA 
encoding the protein of the present invention, termination codon, terminator region, and 
replicon. 

When yeast, animal cells, or insect cells are used as hosts, an expression vector is 
preferably comprising, at least, a promoter, an initiation codon, the DNA encoding the 
protein of the present invention, and a termination codon. It may also comprise the DNA 
encoding a signal peptide, enhancer sequence, 5'- and 3 '-untranslated region of the gene 
encoding the protein of the present invention, splicing junctions, polyadenylation site, 
selectable marker region, and replicon. The expression vector may also contain, if required, 
a gene for gene amplification (marker) that is usually used. 

A promoter/operator region to express the protein of the present invention in bacteria 
comprises a promoter, an operator, and a Shine-Dalgarno (SD) sequence (for example, 
AAGG). For example, when the host is Escherichia, it preferably comprises Trp promoter, 
lac promoter, recA promoter, XPL promoter, lpp promoter, tac promoter, or the like. 
Examples of a promoter to express the protein of the present invention in yeast are PH05 
promoter, PGK promoter, GAP promoter, ADH promoter, and so on. When the host is 
Bacillus, examples thereof are SL01 promoter, SP02 promoter, penP promoter, and so on. 
When the host is a eukaryotic cell such as a mammalian cell, examples thereof are 
SV40-derived promoter, retrovirus promoter, heat shock promoter, and so on, and preferably 
SV-40 and retrovirus-derived one. As a matter of course, the promoter is not limited to the 
above examples. In addition, using an enhancer is effective for expression. 

A preferable initiation codon is, for example, a methionine codon (ATG). 

A commonly used termination codon (for example, TAG, TAA, TGA) is exemplified 
as a termination codon. 

Usually, used natural or synthetic terminators are used as a terminator region. 
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A replicon means a DNA capable of replicating the whole DNA sequence in host 
cells, and includes a natural plasmid, an artificially modified plasmid (DNA fragment 
prepared from a natural plasmid), a synthetic plasmid, and so on. Examples of preferable 
plasmids are pBR322 or its artificial derivatives (DNA fragment obtained by treating 
pBR322 with appropriate restriction enzymes) for E. coli 9 yeast 2 ja plasmid or yeast 
chromosomal DNA for yeast, and pRSVneo ATCC 37198, pSV2dhfr ATCC 37145, pdBPV- 
MMTneo ATCC 37224, pSV2neo ATCC 37149, and such for mammalian cells. 

An enhancer sequence, polyadenylation site, and splicing junction that are usually 
used in the art, such as those derived from SV40 can also be used. 

A selectable marker usually employed can be used according to the usual method. 
Examples thereof are resistance genes for antibiotics, such as tetracycline, ampicillin, or 
kanamycin. 

Examples of genes for gene amplification are dihydrofolate reductase (DHFR) gene, 
thymidine kinase gene, neomycin resistance gene, glutamate synthase gene, adenosine 
deaminase gene, ornithine decarboxylase gene, hygromycin-B-phophotransferase gene, 
aspartate transcarbamylase gene, etc. 

The expression vector of the present invention can be prepared by continuously and 
circularly linking at least the above-mentioned promoter, initiation codon, DNA encoding the 
protein of the present invention, termination codon, and terminator region, to an appropriate 
replicon. If desired, appropriate DNA fragments (for example, linkers, restriction sites, and 
so on), can be used by the usual method such as digestion with a restriction enzyme or 
ligation using T4 DNA ligase. 

Transformants of the present invention can be prepared by introducing the expression 
vector mentioned above into host cells. 

Host cells used in the present invention are not limited as long as they are compatible 
with an expression vector mentioned above and can be transformed. Examples thereof are 
various cells such as wild-type cells or artificially established recombinant cells usually used 
in technical field of the present invention (for example, bacteria (Escherichia and Bacillus), 
yeast (Saccharomyces, Pichia, and such), animal cells, or insect cells). 

E. coli or animal cells are preferably used. Specific examples are E. coli (DH5a, 
TBI, HB 101, and such), mouse-derived cells (COP, L, CI 27, Sp2/0, NS-1, NIH 3T3, and 
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such), rat-derived cells (PC 12, PC12h), hamster-derived cells (BHK, CHO, and such), 
monkey-derived cells (COS1, COS3, COS7, CV1, Velo, and such), and human-derived cells 
(Hela, diploid fibroblast-derived cells, myeloma cells, and HepG2, and such). 

An expression vector can be introduced (transformed (transfected)) into host cells by 
known methods. 

Transformation can be performed, for example, according to the method of Cohen et 
al. (Proc. Natl. Acad. Sci. USA, 69:2110 (1972)), protoplast method (Mol. Gen. Genet., 
168:1 1 1 (1979)), or competent method (I Mol. Biol, 56:209 (1971)) when the hosts are 
bacteria (E. coli, Bacillus subtilis, and such), the method of Hinnen et al. (Proc. Natl. Acad. 
Sci. USA, 75:1927 (1978)), or lithium method (J. Bacterid., 153:163 (1983)) when the host 
is Saccharomyces cerevisiae, the method of Graham (Virology, 52:456 (1973)) when the 
hosts are animal cells, and the method of Summers et al. (Mol. Cell. Biol., 3:2156-2165 
(1983)) when the hosts are insect cells. 

The protein of the present invention can be produced by cultivating transformants (in 
the following, this term includes transfectants) comprising an expression vector prepared as 
mentioned above in nutrient media. 

The nutrient media preferably comprise carbon source, inorganic nitrogen source, or 
organic nitrogen source necessary for the growth of host cells (transformants). Examples of 
the carbon source are glucose, dextran, soluble starch, and sucrose, and examples of the 
inorganic or organic nitrogen source are ammonium salts, nitrates, amino acids, corn steep 
liquor, peptone, casein, meet extract, soy bean cake, and potato extract. If desired, they may 
comprise other nutrients (for example, an inorganic salt (for example, calcium chloride, 
sodium dihydrogenphosphate, and magnesium chloride), vitamins, antibiotics (for example, 
tetracycline, neomycin, ampicillin, kanamycin, and so on). 

Cultivation is performed by a method known in the art. Cultivation conditions such 
as temperature, pH of the media, and cultivation time are selected appropriately so that the 
protein of the present invention is produced in large quantities. 

Specific media and cultivation conditions used depending on host cells are illustrated 
below, but are not limited thereto. 
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When the hosts are bacteria, actinomycetes, yeasts, filamentous fUngi, liquid media 
comprising the nutrient source mentioned above are appropriate. The media with pH 5 to 8 
are preferably used. 

When the host is E. coli, examples of preferable media are LB media, M9 media 
(Miller et al. Exp. MoL Genet, Cold Spring Harbor Laboratory, p.431 (1972)), and so on. 
Using these media, cultivation can be performed usually at 14 to 43°C for about 3 to 
24 hours with aeration and stirring, if necessary. 

When the host is Bacillus, cultivation can be performed usually at 30 to 40°C for 
about 16 to 96 hours with aeration and stirring, if necessary. 

When the host is yeast, an example of media is Burkholder minimal media (Bostian, 
Proc. Natl. Acad. Sci. USA, 77:4505 (1980)). The pH of the media is preferably 5 to 8. 
Cultivation can be performed usually at 20 to 35°C for about 14 to 144 hours with aeration 
and stirring, if necessary. 

When the host is an animal cell, examples of media are MEM media containing about 
5 to 20% fetal bovine serum (Science, 122:501 (1952)), DMEM media (Virology, 8:396 
(1959)), RPMI1640 media (J. Am. Med. Assoc., 199:519 (1967)), 199 media (Proc. Soc. 
Exp. Biol. Med., 73:1 (1950)), and so on. The pH of the media is preferably about 6 to 8. 
Cultivation can be performed usually at about 30 to 40°C for about 15 to 72 hours with 
aeration and stirring, if necessary. 

When the host is an insect cell, an example of media is Grace's media containing fetal 
bovine serum (Proc. Natl. Acad. Sci. USA, 82:8404 (1985)). The pH thereof is preferably 
about 5 to 8. Cultivation can be performed usually at about 20 to 40°C for 15 to 100 hours 
with aeration and stirring, if necessary. 

The protein of the present invention can be produced by cultivating transformants, 
especially mammalian cells, as mentioned above and allowing them to secrete the protein 
into the culture supernatant. 

A culture filtrate (supernatant) is obtained by a method such as filtration or 
centrifiigation of the obtained culture, and the protein of the present invention is purified and 
isolated from the culture filtrate by methods commonly used in order to purify and isolate a 
natural or synthetic protein. 
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Examples of the isolation and purification method are a method utilizing solubility, 
such as salting out and solvent precipitation method; a method utilizing the difference in 
molecular weight, such as dialysis, ultrafiltration, gel filtration, and sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis; a method utilizing charges, such as ion exchange 
5 chromatography and hydroxylapatite chromatography; a method utilizing specific affinity, 
such as affinity column chromatography; a method utilizing the difference in hydrophobicity, 
such as reverse phase high performance liquid chromatography; and a method utilizing the 
difference in isoelectric point, such as isoelectric focusing. 

When the protein of the present invention exists in the periplasm or cytoplasm of 

10 cultured transformants (for example, E. coli), first, the cells are harvested by a usual method 
such as filtration or centrifugation and suspended in appropriate buffer. After the cell wall 
and/or cell membrane of the cells and such are disrupted by the method such as lysis with 
sonication, lysozyme, and freeze-thawing, the membrane fraction comprising the protein of 
the present invention is obtained by the method such as centrifugation or filtration. The 

15 membrane fraction is solubilized with a detergent such as Triton-XlOO to obtain the crude 
extract. Finally, the protein is isolated and purified from the crude extract by the usual 
method as illustrated above. 

By using a DNA (cDNA or genomic DNA) encoding a human-derived AID protein 
included in the protein of the present invention, transgenic non-human mammals secreting 

20 the human AID protein in their body can be prepared. Namely, by integrating the human- 
derived DNA into an endogenous locus of non-human mammals (e.g. mouse), the human 
AID protein of the present invention encoded by the DNA is expressed and secreted in their 
body. The transgenic non-human mammals are included in the present invention. 

The transgenic non-human mammals can be prepared according to the method 

25 usually used for producing a transgenic animal (for example, see "Newest Manual of Animal 
Cell Experiment", LIC press, Chapter 7, pp.361-408, (1990)). 

Specifically, for example, a transgenic mouse can be produced as follows. 
Embryonic stem cells (ES cells) obtained from normal mouse blastocysts are transformed 
with an expression vector in which the gene encoding the human AID protein of the present 

30 invention and a marker gene (for example, neomycin resistance gene) have been inserted in 
an expressible manner. ES cells in which the gene encoding the human AID protein of the 



present invention has been integrated into the endogenous gene are screened by a usual 
method based on expression of the marker gene. Then, the ES cells screened are 
microinjected into a fertilized egg (blastocyst) obtained from another normal mouse (Proc. 
Natl. Acad. Sci. USA, 77:7380-7384 (1980); U.S. Pat. No. 4,873,191). 

The blastocyst is transplanted into the uterus of another normal mouse as the foster 
mother. Then, founder mice are born from the foster mother. By mating the founder mice 
with normal mice, heterozygous transgenic mice are obtained. By mating the heterozygous 
transgenic mice with each other, homozygous transgenic mice are obtained according to 
Mendel's laws. 

Also, so-called "knockout mouse" can be generated based on the nucleotide sequence 
of DNA encoding mouse AID protein included in the present invention. The "knockout 
mouse" in the present invention means the mouse in which the endogenous gene encoding 
the mouse AID protein of the present invention is knocked-out (inactivated). For example, it 
can be generated by positive-negative selection method applying homologous recombination 
(U.S. Pat. No. 5,464,764; 5,487,992; 5,627,059; Proc. Natl. Acad. Sci. USA, 86:8932-8935, 
1989; Nature, 342:435-438, 1989; etc.), and such knockout mice are one embodiment of the 
present invention. 

The "antibody" in the present invention means a polyclonal antibody (antiserum) or a 
monoclonal antibody, and preferably a monoclonal antibody. 

Specifically, it includes an antibody reactive to the above-mentioned protein of the 
present invention and a fragment thereof. 

The "antibody" of the present invention also includes a natural antibody that can be 
prepared by immunizing mammals such as mice, rats, hamsters, guinea pigs, or rabbits with 
the protein of the present invention (including natural, recombinant, and chemically 
synthesized protein and cell), a fragment thereof, or a transformant highly expressing the 
protein of interest by recombinant technology mentioned above; a chimeric antibody and a 
humanized antibody (CDR-grafted antibody) that can be produced by recombinant 
technology; and a human monoclonal antibody that can be produced by using human 
antibody-producing transgenic animals. 

The monoclonal antibody includes those having any one of the isotypes of IgG, IgM, 
IgA, IgD, or IgE. IgG or IgM is preferable. 
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The polyclonal antibody (antiserum) or monoclonal antibody of the present invention 
can be produced by known methods. Namely, mammals, preferably, mice, rats, hamsters, 
guinea pigs, rabbits, cats, dogs, pigs, goats, horses, or bovine, or more preferably, mice, rats, 
hamsters, guinea pigs, or rabbits are immunized, for example, with an antigen mentioned 
above with Freund's adjuvant, if necessary. The polyclonal antibody can be obtained from 
the serum obtained from the animal so immunized. The monoclonal antibodies are produced 
as follows. Hybridomas are produced by fusing the antibody-producing cells obtained from 
the animal so immunized and myeloma cells incapable of producing autoantibodies. Then 
the hybridomas are cloned, and clones producing the monoclonal antibodies showing the 
specific affinity to the antigen used for immunizing the mammal are screened. 

Specifically, the monoclonal antibody can be produced as follows. Immunizations 
are done by injecting or implanting once or several times the above-mentioned protein of the 
present invention, a fragment thereof, the cells that express the protein, and so on as an 
immunogen, if necessary, with Freund's adjuvant, subcutaneously, intramuscularly, 
intravenously, through the footpad, or intraperitoneally into mice, rats, hamsters, guinea pigs, 
or rabbits, preferably mice, rats or hamsters (including transgenic animals generated so as to 
produce antibodies derived from another animal such as the transgenic mouse producing 
human antibody ). Usually, immunizations are performed once to four times every one to 
fourteen days after the first immunization. Antibody-producing cells are obtained from the 
mammal so immunized in about one to five days after the last immunization. 

Hybridomas that secrete a monoclonal antibody can be prepared by the method of 
Kohler and Milstein (Nature, 256:495-497 (1975)) and by its modified method. Namely, 
hybridomas are prepared by fusing antibody-producing cells contained in a spleen, lymph 
node, bone marrow, or tonsil obtained from the non-human mammal immunized as 
mentioned above, preferably a spleen, with myeloma cells without autoantibody-producing 
ability, which are derived from, preferably, a mammal such as mice, rats, guinea pigs, 
hamsters, rabbits, or humans, or more preferably, mice, rats, or humans. 

For example, mouse-derived myeloma P3/X63-AG8.653 (653; ATCC No. CRL1580), 
P3/NSI/l-Ag4-l (NS-1), P3/X63-Ag8.Ul (P3U1), SP2/0-Agl4 (Sp2/0, Sp2), PAI, FO, or 
BW5147; rat-derived myeloma 210RCY3-Ag.2.3.; or human-derived myeloma U-266AR1, 
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GM1500-6TG-A1-2, UC729-6, CEM-AGR, D1R11, or CEM-T15 can be used as a myeloma 
used for the cell fusion. 

Hybridoma clones producing monoclonal antibodies can be screened by cultivating 
the hybridomas, for example, in microtiter plates and by measuring the reactivity of the 
5 culture supernatant in the well in which hybridoma growth is observed, to the immunogen 
used for the immunization mentioned above, for example, by an enzyme immunoassay such 
as RIA and ELISA. 

The monoclonal antibodies can be produced from hybridomas by cultivating the 
hybridomas in vitro or in vivo such as in the ascites of mice, rats, guinea pigs, hamsters, or 
10 rabbits, preferably mice or rats, more preferably mice and isolating the antibodies from the 
resulting the culture supernatant or ascites fluid of a mammal. 

In vitro cultivation can be performed depending on the property of cells to be 
cultured, on the object of a test study, and on various culture, by using known nutrient media 
or any nutrient media derived from known basal media for growing, maintaining, and storing 
1 5 the hybridomas to produce monoclonal antibodies in the culture supernatant. 

Examples of basal media are low calcium concentration media such as Ham'F12 
medium, MCDB153 medium, or low calcium concentration MEM medium, and high calcium 
concentration media such as MCDB104 medium, MEM medium, D-MEM medium, 
% RPMI1640 medium, ASF 104 medium, or RD medium. The basal media can contain, for 

U 20 example, sera, hormones, cytokines, and/or various inorganic or organic substances 
3 depending on the objective. 

588 Monoclonal antibodies can be isolated and purified from the culture supernatant or 

ascites mentioned above by saturated ammonium sulfate precipitation, euglobulin 
precipitation method, caproic acid method, caprylic acid method, ion exchange 
25 chromatography (DEAE or DE52), affinity chromatography using anti-immunoglobulin 
column or protein A column. 

Furthermore, monoclonal antibodies can be obtained in a large quantity by cloning a 
gene encoding a monoclonal antibody from the hybridoma, generating transgenic bovines, 
goats, sheep, or pigs in which the gene encoding the antibody is integrated in its endogenous 
30 gene using transgenic animal generating technique, and recovering the monoclonal antibody 
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derived from the antibody gene from milk of the transgenic animals (Nikkei Science, No.4, 
pp.78-84(1997)). 

The "chimeric antibody" of the present invention means a monoclonal antibody 
prepared by genetic engineering, and specifically, a chimeric monoclonal antibody, for 
example, mouse/human chimeric antibody, whose variable region is a mouse 
immunoglobulin-derived variable region and whose constant region is a human 
immunoglobulin-derived constant region. 

The constant region derived from human immunoglobulin has the amino acid 
sequence inherent in each isotype such as IgG, IgM, IgA, IgD, IgE, etc. The constant region 
of the recombinant chimeric monoclonal antibody of the present invention can be that of 
human immunoglobulin belonging to any isotype. Preferably, it is the constant region of 
human IgG. 

The chimeric monoclonal antibody of the present invention can be produced, for 
example, as follows. Needless to say, the production method is not limited thereto. 

For example, mouse/human chimeric monoclonal antibody can be prepared, by 
referring to Experimental Medicine: SUPPLEMENT, 1.6, No.10 (1988); and Examined 
Published Japanese Patent Application (JP-B) No. Hei 3-73280. Namely, it can be prepared 
by ligating CH gene (C gene encoding the constant region of H chain) obtained from the 
DNA encoding human immunoglobulin to the downstream of active VH genes (rearranged 
VDJ gene encoding the variable region of H chain) obtained from the DNA encoding mouse 
monoclonal antibody isolated from the hybridoma producing the mouse monoclonal 
antibody, and by ligating the C L gene (C gene encoding the constant region of L chain) 
obtained from the DNA encoding human immunoglobulin to the downstream of active Vl 
genes (rearranged VJ gene encoding the variable region of L chain) obtained from the DNA 
encoding the mouse monoclonal antibody isolated from the hybridoma, and operably 
inserting those into the same or different vectors in an expressible manner, followed by 
transformation of host cells with the expression vector, and cultivation of the transformants. 

Specifically, DNAs are first extracted from mouse monoclonal antibody-producing 
hybridoma by the usual method, digested with appropriate restriction enzymes (for example, 
EcoRI and Hindlll), electrophoresed (using, for example, 0.7% agarose gel), and analyzed by 
Southern blotting. After the electrophoresed gel is stained, for example, with ethidium 
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bromide and photographed, the gel is given marker positions, washed twice with water, and 
soaked in 0.25 M HC1 for 15 minutes. Then, the gel is soaked in 0.4 N NaOH solution for 
10 minutes with gentle stirring. The DNAs are transferred to a filter for 4 hours following 
the usual method. The filter is recovered and washed twice with 2 x SSC. After the filter is 
sufficiently dried, it is baked at 75°C for 3 hours, treated with 0.1 x SSC/0.1% SDS at 65°C 
for 30 minutes, and then soaked in 3 x SSC/0.1% SDS. The filter obtained is treated with 
prehybridization solution in a plastic bag at 65°C for 3 to 4 hours. 

Next, 32 P-labeled probe DNA and hybridization solution are added to the bag and 
reacted at 65°C about 12 hours. After hybridization, the filter is washed under an appropriate 
salt concentration, reaction temperature, and time (for example, 2 x SSC/0.1% SDS, room 
temperature, 10 minutes). The filter is put into a plastic bag with a little 2 x SSC, and 
subjected to autoradiography after the bag is sealed. 

Rearranged VDJ gene and VJ gene encoding H chain and L chain of mouse 
monoclonal antibody respectively are identified by Southern blotting mentioned above. The 
region comprising the identified DNA fragment is fractionated by sucrose density gradient 
centrifiigation and inserted into a phage vector (for example, Charon 4A, Charon 28, 
XEMBL3, XEMBL4, etc.). E. coli (for example, LE392, NM539, etc.) are transformed with 
the phage vector to generate a genomic library. The genomic library is screened by plaque 
hybridization such as the Benton-Davis method (Science, 196:180-182 (1977)) using 
appropriate probes (H chain J gene, L chain (k) J gene, etc.) to obtain positive clones 
comprising rearranged VDJ gene or VJ gene respectively. By making the restriction map and 
determining the nucleotide sequence of the clones obtained, it is confirmed that genes 
comprising the desired, rearranged Vh (VDJ) gene or Vl (VJ) gene have been obtained. 

Separately, human CH gene and human CL gene used for chimerization are isolated. 
For example, when a chimeric antibody with human IgGl is produced, Cyi gene is isolated 
as a Ch gene, and Ck gene is also isolated as a Cl gene, are isolated. These genes can be 
isolated from human genomic library with mouse Cyi gene and mouse Ck gene, 
corresponding to human Cyi gene and human Ck gene, respectively, as probes, taking 
advantage of the high homology between the nucleotide sequences of mouse 
immunoglobulin gene and that of human immunoglobulin gene. 
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Specifically, DNA fragments comprising human Ck gene and an enhancer region are 
isolated from human X Charon 4 A Haelll-Alul genomic library (Cell, 15:1 157-1 174 (1978)), 
for example, using a 3 kb Hindlll-BamHI fragment from clone Igl46 (Proc. Natl. Acad. Sci. 
USA, 75:4709-4713 (1978)) and a 6.8 kb EcoRI fragment from clone MEP10 (Proc. Natl. 
Acad. Sci. USA, 78:474-478 (1981)) as probes. In addition, for example, after human fetal 
hepatocyte DNA is digested with Hindlll and fractioned by agarose gel electrophoresis, a 
5.9 kb fragment is inserted into A788 and then human Cyi gene is isolated with the probes 
mentioned above. 

Using mouse Vh gene, mouse Vl gene, human Ch gene, and human Cl gene so 
obtained, and taking promoter region and enhancer region into consideration, human Ch gene 
is inserted downstream of mouse Vh gene and human Cl gene is inserted downstream of 
mouse Vl gene in an expression vector such as pSV2gpt or pSV2neo with appropriate 
restriction enzymes and DNA ligase following the usual method. In this case, chimeric genes 
of mouse Vh gene/human Ch gene and mouse Vl gene/human Cl gene can be respectively 
inserted into a same or different expression vector. 

Chimeric gene-inserted expression vector(s) thus prepared are introduced into 
myeloma cells (e.g., P3X63 Ag8 653 cells or SP210 cells) that do not produce antibodies by 
the protoplast fusion method, DEAE-dextran method, calcium phosphate method, or 
electroporation method. The transformants are screened by cultivating in a medium 
containing a drug corresponding to the drug resistance gene inserted into the expression 
vector and, then, cells producing desired chimeric monoclonal antibodies are obtained. 

Desired chimeric monoclonal antibodies are obtained from the culture supernatant of 
antibody-producing cells thus screened. 

The "humanized antibody (CDR-grafted antibody)" of the present invention is a 
monoclonal antibody prepared by genetic engineering and specifically means a humanized 
monoclonal antibody wherein a portion or the whole of the complementarity determining 
regions of the hyper-variable region are derived from those of the hyper-variable region from 
mouse monoclonal antibody, the framework regions of the variable region are derived from 
those of the variable region from human immunoglobulin, and the constant region is derived 
from that from human-immunoglobulin. 
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The complementarity determining regions of the hyper-variable region exists in the 
hyper-variable region in the variable region of an antibody and means three regions which 
directly binds, in a complementary manner, to an antigen (complementarity-determining 
residues, CDR1, CDR2, and CDR3). The framework regions of the variable region mean 
four comparatively conserved regions intervening upstream, downstream or between the 
three complementarity-determining regions (framework region, FR1, FR2, FR3, and FR4). 

In other words, a humanized monoclonal antibody means that in which the whole 
region except a portion or the whole region of the complementarity determining regions of 
the hyper-variable region of a mouse monoclonal antibody has been replaced with their 
corresponding regions derived from human immunoglobulin. 

The constant region derived from human immunoglobulin has the amino acid 
sequence inherent in each isotype such as IgG, IgM, IgA, IgD, and IgE. The constant region 
of a humanized monoclonal antibody in the present invention can be that from human 
immunoglobulin belonging to any isotype. Preferably, it is the constant region of human 
IgG. The framework regions of the constant region derived from human immunoglobulin are 
not particularly limited. 

The humanized monoclonal antibody of the present invention can be produced, for 
example, as follows. Needless to say, the production method is not limited thereto. 

For example, a recombinant humanized monoclonal antibody derived from mouse 
monoclonal antibody can be prepared by genetic engineering, referring to Published Japanese 
Translations of PCT International Publication No. Hei 4-506458 and Unexamined Published 
Japanese Patent Application (JP-A) No. Sho 62-296890. Namely, at least one mouse H chain 
CDR gene and at least one mouse L chain CDR gene corresponding to the mouse H chain 
CDR gene are isolated from hybridomas producing mouse monoclonal antibody, and human 
H chain gene encoding the whole region except human H chain CDR corresponding to 
mouse H chain CDR mentioned above and human L chain gene encoding the whole region 
except human L chain CDR corresponding to mouse L chain CDR mentioned above are 
isolated from human immunoglobulin genes. 

The mouse H chain CDR gene(s) and the human H chain gene(s) so isolated are 
inserted, in an expressible manner, into an appropriate vector so that they can be expressed. 
Similarly, the mouse L chain CDR gene(s) and the human L chain gene(s) are inserted, in an 
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expressible manner, into another appropriate vector so that they can be expressed. 
Alternatively, the mouse H chain CDR gene(s)/human H chain gene(s) and mouse L chain 
CDR gene(s)/human L chain gene(s) can be inserted, in an expressible manner, into the same 
expression vector so that they can be expressed. Host cells are transformed with the 
expression vector thus prepared to obtain transformants producing humanized monoclonal 
antibody. By cultivating the transformants, desired humanized monoclonal antibody is 
obtained from culture supernatant. 

The "human antibody" used in the present invention is immunoglobulin in which the 
entire regions comprising the variable and constant region of H chain, and the variable and 
constant region of L chain constituting immunoglobulin are derived from the genes encoding 
human immunoglobulin. 

The human antibody can be produced in the same way as the production method of 
polyclonal or monoclonal antibodies mentioned above by immunizing, with an antigen, a 
transgenic animal which for example, at least human immunoglobulin gene(s) have been 
integrated into the locus of a non-human mammal such as a mouse by the usual method. 

For example, a transgenic mouse producing human antibodies is prepared by the 
methods described in already published literatures (Nature Genetics, 7:13-21 (1994); Nature 
Genetics, 15:146-156 (1997); JP-WA Hei 4-504365; W094/25585; Nikkei Science, No.6, 
pp.40-50 (1995); W094/25585; Nature, 368:856-859 (1994); JP-WA No. Hei 6-500233). 

The "portion of an antibody" used in the present invention means a partial region of 
the antibody, and preferably the monoclonal antibody of the present invention as mentioned 
above, and specifically, means F(ab') 2 , Fab', Fab, Fv (variable fragment of antibody), sFv, 
dsFv (disulfide stabilized Fv), or dAb (single domain antibody) (Exp. Opin. Ther. Patents, 6, 
No.5,pp.441-456(1996)). 

"F(ab') 2 " and "Fab"' can be produced by treating immunoglobulin (monoclonal 
antibody) with a protease such as pepsin and papain, and means an antibody fragment 
generated by digesting immunoglobulin near the disulfide bonds existing between the hinge 
regions in each of the two H chains. For example, papain cleaves IgG upstream of the 
disulfide bonds existing between the hinge regions in each of the two H chains to generate 
two homologous antibody fragments in which an L chain composed of Vl (L chain variable 
region) and C L (L chain constant region), and an H chain fragment composed of V H (H chain 
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variable region) and ChyI (yl region in the constant region of H chain) are connected at their 
C terminal regions through a disulfide bond. Each of these two homologous antibody 
fragments is called Fab'. Pepsin also cleaves IgG downstream of the disulfide bonds existing 
between the hinge regions in each of the two H chains to generate an antibody fragment 
slightly larger than the fragment in which the two above-mentioned Fab' are connected at the 
hinge region. This antibody fragment is called F(ab')2- 

The "cell producing a monoclonal antibody reactive to a protein or a fragment 
thereof of the present invention means any cell producing the above-described monoclonal 
antibody of the present invention. 

More specifically, the following is included: 

(1) B cells that are obtained by immunizing the non-human mammals with the above- 
mentioned protein of the present invention, a fragment thereof, or the cells producing the 
protein and that produce a monoclonal antibody reactive to the protein of the present 
invention or a fragment thereof. 

(2) The above-mentioned hybridomas (fused cell) prepared by fusing the thus-obtained B 
cells producing the antibody with myeloma cells derived from mammals. 

(3) Monoclonal antibody-producing transformants obtained by transforming other cells 
than the monoclonal antibody-producing B cells and hybridomas with genes encoding the 
monoclonal antibody isolated from the monoclonal antibody-producing B cells or 
hybridomas (either the heavy chain-encoding gene or the light chain-encoding gene, or both). 

The monoclonal antibody-producing transformants of (3) mean recombinant cells 
producing a recombinant monoclonal antibody produced by B cells of (1) or hybridomas of 
(2). These antibody producing-transformants can be produced by the method as used for 
producing the above-described chimeric monoclonal antibody and humanized monoclonal 
antibody. 

The "pharmaceutical composition" used herein means a pharmaceutical composition 
comprising of any of the protein, fragment thereof, antibody, or portion thereof defined 
hereinabove, and a pharmaceutically acceptable carrier. 

The "pharmaceutically acceptable carrier" includes an excipient, a diluent, an 
expander, a disintegrating agent, a stabilizer, a preservative, a buffer, an emulsifier, an 
aromatic, a colorant, a sweetener, a viscosity-increasing agent, a flavor, a dissolving agent, or 
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other additives. Using one or more of such carriers, a pharmaceutical composition can be 
formulated into tablets, pills, powders, granules, injections, solutions, capsules, troches, 
elixirs, suspensions, emulsions, or syrups. The pharmaceutical composition can be 
administered orally or parenterally. Other forms for parenteral administration include a 
5 solution for external application, suppository for rectal administration, and pessary, 
prescribed by the usual method, which comprises one or more active ingredient. 

The dosage can vary depending on the age, sex, weight, and symptoms of a patient, 
effect of treatment, administration route, period of treatment, or the kind of active ingredient 
(protein or antibody mentioned above) contained in the pharmaceutical composition. 
10 Usually, the pharmaceutical composition can be administered to an adult in a dose of 10 jig 
to 1000 mg (or 10 jig to 500 mg) per one administration. Depending on various conditions, 
the lower dosage may be sufficient in some cases, and a higher dosage may be necessary in 
^ other cases. 

C- In particular, the injection can be produced by dissolving or suspending the antibody 

S 15 in a non-toxic, pharmaceutical^ acceptable carrier such as physiological saline or 

s fJ commercially available distilled water for injections by adjusting the concentration to 0. 1 jig 

3Q antibody/ml carrier to 10 mg antibody/ml carrier. The injection thus produced can be 

^ administered to a human patient in need of treatment in a dose of 1 jag to 100 mg/kg body 

y weight, preferably 50 \ig to 50 mg/kg body weight, once or more times a day. Examples of 

yy 

fy 20 administration routes are medically appropriate administration routes such as intravenous 
injection, subcutaneous injection, intradermal injection, intramuscular injection, or 
intraperitoneal injection, preferably intravenous injection. 

The injection can also be prepared into a non-aqueous diluent (for example, 
propylene glycol, polyethylene glycol, vegetable oil such as olive oil, and alcohols such as 
25 ethanol), suspension, or emulsion. 

The injection can be sterilized by filtration with a bacteria-non-penetrable filter, by 
mixing bacteriocide, or by irradiation. The injection can be prepared at the time of use. 
Namely, it is freeze-dried to make a sterile solid composition, and can be dissolved in sterile 
distilled water for injection or another solvent before use. 
30 The pharmaceutical composition of the present invention is useful as a drug for 

preventing and treating, for example, primary immunodeficiency syndrome with congenital 
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disorder of immune system, mainly immunodeficiency considered to develop by B 
lymphocyte deficiency, decrease, or dysfunction (e.g., sex-linked agammaglobulinemia, sex- 
linked agammaglobulinemia with growth hormone deficiency, immunoglobulin deficiency 
with high IgM level, selective IgM deficiency, selective IgE deficiency, immunoglobulin 
heavy chain gene deletion, k chain deficiency, IgA deficiency, IgG subclass selective 
deficiency, CVID (common variable immunodeficiency), infantile transient 
dysgammaglobulinemia, Rosen syndrome, severe combined immunodeficiency (sex-linked, 
autosomal recessive), ADA (adenosine deaminase) deficiency, PNP (purine nucleoside 
phosphorylase) deficiency, MHC class II deficiency, reticular dysplasia, Wiskott-Aldrich 
syndrome, ataxia telangiectasia, DiGeorge syndrome, chromosomal aberration, familial Ig 
hypermetabolism, hyper IgE syndrome, Gitlin syndrome, Nezelof syndrome, Good 
syndrome, osteodystrophy, transcobalamin syndrome, secretory bead syndrome, etc.), various 
diseases with antibody production deficiency that are secondary immunodeficiency 
syndrome with disorder of immune system caused by an acquired etiology (for example, 
AIDS, etc.), and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, 
conjunctivitis, allergic rhinitis, allergic enteritis, drug-induced allergy, food allergy, allergic 
urticaria, glomerulonephritis, etc.), and for reliving condition due to various 
immunodeficiency with the diseases. 

The DNA of the present invention described above, namely, "DNA comprising any 
partial nucleotide sequence of SEQ ID NO:7, from SEQ ID NO:9 to SEQ ID NO: 15, SEQ ID 
NO:35, those with partial chemical modification, DNA comprising complementary 
nucleotide sequences to the partial sequence, or those with partial chemical modification" are 
included. 

Here, the "partial nucleotide sequence" means the partial nucleotide sequence 
comprising any number of bases at any region included in any nucleotide sequence listed in 
SEQ ID NO:7, from SEQ ID NO:9 to SEQ ID NO: 15, or SEQ ID NO:35. 

The DNA is useful as probes in DNA hybridization or RNA hybridization 
procedures. In the purpose of using the DNA as a probe, continuous nucleotide sequence of 
over 20 bases, preferably continuous nucleotide sequence of over 50 bases, more preferably 
over 100 bases, much more preferably over 200 bases, especially preferably over 300 bases, 
can be listed as the partial nucleotide sequences. 
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Also, the DNA described above as mentioned before, are useful as primers for PGR. 
In the purpose of using the DNA as PCR primers, continuous partial nucleotide sequence of 
from 5 to 100 bases, preferably from 5 to 70 bases, more preferably from 5 to 50 bases, much 
more preferably from 5 to 30 bases, can be listed as the partial nucleotide sequences. 

Moreover, the DNA described above are useful as antisense drug. The DNA, 
hybridizing to a DNA or an RNA encoding the AID protein of the present invention, can 
inhibit transcription of the DNA to mRNA or translation of the mRNA into the protein. 

In purpose of using above-mentioned DNA to antisense drug, the partial nucleotide 
sequence consists of 5 to 100 consecutive nucleotides, preferably 5 to 70 consecutive 
nucleotides, more preferably 5 to 50 consecutive nucleotides, and still more preferably 5 to 
30 consecutive nucleotides. 

When the DNA is used as an antisense DNA pharmaceutical, the DNA sequence can 
be modified chemically in part for extending the half-life (stability) of the blood 
concentration of the DNA administered to patients, for increasing the intracellular-membrane 
permeability of the DNA, or for increasing the degradation resistance or the absorption of the 
orally administered DNA in the digestive organs. The chemical modification includes, for 
example, the modification of the phosphate bonds, the riboses, the nucleotide bases, the sugar 
moiety, the 3' end and/or the 5' end in the structure of the oligonucleotide DNA. 

The modification of phosphate bond includes, for example, the conversion of one or 
more of the bonds to phosphodiester bonds (D-oligo), phosphorothioate bonds, 
phosphorodithioate bonds (S-oligo), methyl phosphonate (MP-oligo), phosphoroamidate 
bonds, non-phosphate bonds or methyl phosphonothioate bonds, or combinations thereof. 
The modification of the ribose includes, for example, the conversion to 2'-fluororibose or 
2'-0-methylribose. The modification of the nucleotide base includes, for example, the 
conversion to 5-propynyluracil or 2-aminoadenine. 

Also, another one of the present invention relates to "methods of identifying 
substances regulating the production of the AID protein of the present invention or the 
transcription of the gene encoding AID protein to mRNA". The method of the present 
invention is namely "the method of screening of drugs capable of regulating functions of 
AID protein or AID gene". 
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As the cells used in the method of the present invention, any cells, as long as capable 
of producing AID protein of the present invention, can be used. For instance, native cells 
(preferably of mouse or human), transgenic cells transformed with a gene encoding AID 
protein of the present invention, cells introduced with RNA encoding AID protein of the 
present invention, etc., can be listed. 

As the host cells used for preparing the transgenic cells, various cells, mentioned in 
the part explaining in detail on the method of expressing the protein of the present invention 
using the DNA of the protein described above, can be used. 

For instance, various cells such as naturally established cells or artificially established 
transgenic cells (e.g. bacteria {Escherichia, Bacillus), yeast {Saccharomyces, Pichia), animal 
cells and insect cells) can be exemplified. 

Preferably, animal cells, namely, cells derived from mouse (COP, L, CI 27, Sp2/0, 
NS-1, or NIH3T3, etc.), cells derived from rat (PC12, PC12h, etc.), cells derived from 
hamster (BHK, and CHO, etc.), cells derived from monkey (COS1, COS3, COS7, CV1, and 
Velo, etc.), and cells derived from human (Hela, cells derived from diploid fibroblast, 
HEK293 cells, myeloma cells, and Namalwa, etc.) can be exemplified. 

The "substance" in the present invention means natural substance existing in the 
nature and any substance prepared artificially. The substances can be grouped into "peptidic 
substance" and "non-peptidic substance". 

As the "non-peptidic substance", "DNA comprising partial nucleotide sequence, or 
chemically modified DNA derived from it" that are useful as antisense drug as described 
above, "antisense RNA" with similar structural and pharmacological property to the 
antisense DNA, or any chemically synthesized "compounds" can be exemplified. The 
"compounds" herein means compounds excepting DNA, RNA, and peptidic substances. 
Namely, compounds with molecular weight of smaller than from 100 to approximately 1000, 
preferably compounds with molecular weight of from 100 to 800, more preferably molecular 
weight of from 100 to 600, can be exemplified. 

As the "peptidic substance", antibodies already described above in detail (preferably 
monoclonal antibodies, more preferably recombinant antibodies or human monoclonal 
antibodies), oligopeptides, or chemically modified substance derived from them can be 
exemplified. Examples of an oligopeptide are a peptide comprising 5 to 30 amino acids, 
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preferably 5 to 20 amino acids. The chemical modification can be designed depending on 
various purposes, for example, the increased half-life in blood in the case of administering in 
vivo, or the increased tolerance against the degradation or increased absorption in digestive 
tract at the oral administration. 

Methods described in from (24) to (28) above includes so-called reporter gene assay, 
as one of the method of the present invention. 

As the "reporter protein", luciferase derived from firefly or sea pansy, or GFP derived 
fromjerryfish, are preferred. 

As the "reporter gene assay", methods described below are representative. 

Transgenic cells are generated by transforming cells commonly used in the 
production of recombinant proteins with expression vector, in which a gene encoding the 
target protein and a gene encoding the reporter protein are inserted to the vector so that the 
transcription of the gene encoding the reporter protein to mRNA occurs dependently on the 
signal of the transcription of the gene of target protein to mRNA. The test substances 
(described above) are applied to the obtained transformant cells. Analysis that whether the 
compound affects the expression of transporter molecule can be accomplished by measuring 
the level of the target protein by indirect measuring of the amount of fluorescence emitted by 
the reporter protein expressed in parallel with the target protein (for reference, see U.S. Pat. 
No. 5,436,128 and U.S. Pat. No. 5,401,629). 

Also, the identification of the compounds using the present assay can be 
accomplished by manual operation, but it can also be readily and simply done automatically 
by using so-called High-Throughput Screening using robots (SOSHIKI BAIYO KOUGAKU, 
23:521-524; U.S. Pat. No. 5,670,113). 

The "cells" and "substances" used in the methods described above contain the same 
meaning as defined above. 

The substances identified by the methods of the present invention are very useful as 
drug for therapy of various diseases considered to be caused by the hyperfunction or 
deficiency of the AID protein of the present invention or by the deficiency or mutation of the 
AID gene, or for remission of various symptoms supervene with the diseases. 
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DESCRIPTION OF DRAWINGS 

FIG. 1 is a is the photograph which shows the production state DNA including an Sa 
sequence looped out with the class switch recombination in mouse B cell clone CH12F3-2 
cultured under the various conditions. 

FIG. 1(a) shows the electrophoretic state of DNA including an Sa sequence looped 
out with class switch recombination, amplified by PCR using DNA derived from mouse B 
cell clone CH12F3-2 cultured under the various conditions. 

Lanes 1 and 6 show the electrophoretic state of marker DNAs. Lane 2 shows the 
electrophoretic state of PCR product using DNA from cells cultured in the condition 
excluding IL-4, CD40L, TGF P or cycloheximide, as a template. Lane 3 shows the 
electrophoretic state of DNA product using DNA from cells cultured in the presence of 
cycloheximide only, as a template. Lane 4 shows the electrophoretic state of PCR product 
using DNA from cells cultured in the presence of IL-4, CD40L and TGF p, as a template. 
Lane 5 shows the electrophoretic state of PCR product cultured in the presence of 11-4, 
CD40L, TGF p, and cycloheximide, as a template. 

FIG. 1(b) shows the result of Southern hybridization for DNA including an Sa 
sequence looped out with class switch recombination, amplified by PCR using DNA derived 
from mouse B cell clone CH12F3-2 cultured under the various conditions. 

Lane 1 shows the result of hybridization against PCR product using DNA from cells 
cultured in the condition excluding any of IL-4, CD40L, TGF p or cycloheximide, as a 
template. Lane 2 shows the result of Southern hybridization against PCR product using DNA 
from cells cultured in the presence of cycloheximide only, as a template. Lane 3 shows the 
result of hybridization against PCR product using DNA from cells cultured in the presence of 
IL-4, CD40L, and TGF p, as a template. Lane 4 shows the result of hybridization against 
PCR product using DNA from cells cultured in the presence of IL-4, CD40L, TGF p and 
cycloheximide, as a template. 

FIG. 2 is a photograph showing the production state of DNA including an Sa 
sequence looped put with class switch recombination, amplified by PCR using DNA derived 
from mouse B cell clone CH12F3-2 cultured in various conditions. 
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FIG. 2(a) shows the electrophoretic state of DNA from DNA including an Sa 
sequence looped out with class switch recombination in mouse B cell clone CH12F3-2 
cultured in the various conditions, stained with ethidium bromide. 

Lanes 1 and 6 show the electrophoretic state of marker DNAs. Lane 2 shows 
5 electrophoretic state of PCR product using DNA from cells cultured in the condition 
excluding IL-4, CD40L, TGF p or cycloheximide, as a template. Lane 3 shows the 
electrophoretic state of DNA product using DNA from cells cultured in the presence of 
cycloheximide only, as a template. Lane 4 shows the electrophoretic state for PCR product 
using DNA from cells cultured in the presence of IL-4, CD40L and TGF p, as a template. 
10 Lane 5 shows the electrophoretic state for PCR product cultured in the presence of 11-4, 
CD40L, TGF p , and cycloheximide, as a template. 

FIG. 1(b) shows the result of Southern hybridization for DNA including an Sa 
sequence looped out with class switch recombination, amplified by PCR using DNA derived 
*J3 from mouse B cell clone CH12F3-2 cultured under the various conditions, 

pi 15 Lane 1 shows the result of hybridization against a PCR product using DNA from cells 

J£J cultured under the condition excluding any one of IL-4, CD40L, TGF P or cycloheximide, as 

a template. Lane 2 shows the result of hybridization against a PCR product using DNA from 
cells cultured in the presence of cycloheximide only, as a template. Lane 3 shows the result 
of hybridization against a PCR product using the DNA from cells cultured in the presence of 
20 IL-4, CD40L, and TGF p, as a template. Lane 4 shows the result of hybridization against a 
PCR product using DNA from cells cultured in the presence of IL-4, CD40L, TGF p and 
cycloheximide, as a template. 

FIG. 3 shows the result of Northern blotting using a cDNA fragment coding a 
radiolabeled 23C9 (AID) protein, against mRNA derived from mouse B cell clone CH12F3-2 
25 cultured under the various conditions. 

Lane 1 shows the result of blotting against mRNA from cells cultured in the condition 
excluding any one of IL-4, CD40L, or TGF P or cycloheximide. Lane 2 shows the result of 
blotting against mRNA from cells cultured in the presence of cycloheximide, only. Lane 3 
shows the result of blotting against mRNA from cells cultured in the presence of IL-4, 
30 CD40L and TGF p. Lane 4 shows the result of blotting against mRNA cultured in the 
presence of IL-4, CD40L, TGF p, and cycloheximide. 
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FIG. 4 shows the result of Northern blotting using radio-labeled cDNA fragment 
coding 23C9 (AID) protein as a probe against mRNA derived from mouse B cell clone 
CH12F3-2 cultured in the various conditions. 

Lane 1 shows the result of blotting against mRNA from cells cultured in the condition 
5 excluding IL-4, CD40L, TGF P or cycloheximide. Lane 2 shows the result of blotting against 
mRNA from cells cultured in the presence of cycloheximide, only. Lane 3 shows the result of 
the blotting against mRNA form cells cultured in the presence of IL-4, CD40L, and TGF p. 
Lane 4 shows the result of the blotting against mRNA form cells cultured in the presence of 
IL-4, CD40L, TGF p and cycloheximide. 
1^^^2/^^^ 5 shows the homology between an amino acid sequence of mouse AID protein 
and that of Wise APOBEC-1 

An amino acid in a closed box shows an identical amino acid. A region in an open 
box indicates a cytidine deaminase motif. An amino acid with an asterisk (*) or an arrow 
indicates an amino acid conserved among APOBEC-1 proteins derived from rat, mouse, 
m 15 rabbit, and human. 

FIG. 6 shows a phylogenic tree of various enzymes belonging to a cytosine 
nucleoside / nucleotide deaminase family, prepared based on cytidine deaminase motif. 

FIG. 7 shows a photograph indicating the electrophoretic state for AID-GST fusion 
protein in the molecular weight analysis by the gel electrophoresis and silver staining 
20 method. 

Lane 1 shows the electrophoretic state for a marker molecule. Lane 2 shows the 
electrophoretic state for various proteins included in extracts from wild type Escherichia coli 
DH 5 a. Lane 3 shows the electrophoretic state for purified AID-GST fusion protein. 
FIG. 8 shows the electrophoretic state for AID-GST fusion protein by Western 
25 blotting using anti-AID protein peptide antibody. 

Lane 1 shows the electrophoretic state for various proteins included in the extract 
from wild type E. coli DH5a. 

Lane 2 shows the electrophoretic state for purified AID-GST protein. 
FIG. 9 shows a cytidine deaminase activity depending on the concentrations of AID 
30 proteins. 
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FIG. 10 shows the inhibitory effect of tetrahydrouridine which is an inhibitor specific 
to cytidine deaminase on a cytidine deaminase activity in AID protein. 

FIG. 1 1 shows the inhibitory effect of each of 1,10-o-phenanthrolime which is a zinc- 
chelating agent, and 1,7-ophenanthroline which is an inactivated isomer thereof on a 
cytidine deaminase activity in AID protein. 

FIG. 12 is a photograph indicating expression state for mRNA of AID in various 
tissues in mouse, analyzed by Northern blotting method. 

FIG. 13 is a photograph indicating the expression state for mRNA of AID in various 
lymphatic tissues in mouse, analyzed by RT-PCR method. 

FIG. 14 is the photograph showing expression state for mRNA of AID as time goes, 
in activated mouse B cell clone CH12F3-2, analyzed by Northern blotting method. 

FIG. 15 a photograph showing expression state for mRNA of AID in mouse B cell 
clone CH12F3-2 stimulated with cytokine in various combinations, analyzed by Northern 
blotting. 

FIG. 16 shows a photograph indicating expression state for mRNA of AID in mouse 
spleen B cells, stimulated with stimulants in various combinations, analyzed by Northern 
hybridization method. 

FIG. 17 is a photograph indicating expression state for mRNA of AID in splenocytes 
derived from mice immunized with sheep red blood cells, analyzed by Northern blotting 
analysis. 

FIG. 18 shows expression state for mRNA of AID in splenocytes derived from mice 
immunized with sheep red blood cells, analyzed by RT-PCR. 

FIG. 19 is a photograph indicating localization of expression for AID mRNA in 
splenocytes derived from a normal mouse or a mouse immunized by sheep red blood cells, 
specifically, analyzed by in situ hybridization. 

FIGs. 19 (A) and (D) indicate the result in the hybridization using a sense ATP probe. 
FIGs. 19 (B) and (E) show localization for AID mRNA expression in hybridization using an 
antisense-AID probe. FIGs. 19 (C) and (F) show localization of germinal center in staining 
test by FITC-labeled PNA. FIG.s 19 (A), (B), and (C) indicate the result in the test using 
spleen tissues derived from normal mouse (before the immunization of sheep red blood 
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cells). FIGs. 19 (D), (E), and (F) show the results of the examination using spleen tissue 
slices prepared 5 days after immunizing a mouse with sheep red blood cells. 

FIG. 20 is a photograph showing the localization of expression for AID mRNA in 
spleen tissue and payer's patch tissue, respectively, derived from a normal mouse or from a 
mouse immunized with sheep red blood cells, respectively, analyzed by in situ hybridization. 

FIGs. 20 (A), (D), and (G) show the results in the hybridization using a sense AID 
probe. FIGs. 20 (B), (E), and (H) show the localization of the expression for AID mRNA in 
hybridization using an antisense AID probe. FIGs. 20 (C), (F), and (I) show the localization 
of germinating center in the staining test by FITC-labeled PNA. FIGs. 20 (A), (B), and (C) 
show the result of the examination using spleen tissues derived from a normal mouse (before 
immunization by sheep red blood cells). FIGs. 20 (D), (E), and (F) show the results of the 
examination using spleen tissue slices prepared 5 days after immunization of a mouse with 
sheep red blood cells. FIGs. 20 (G), (H), and (I) show the results of test using payer's patch 
prepared 5 days after the immunization of a mouse with sheep red blood cells. 

FIG. 21 schematically shows relative locations of partial nucleotide sequences of 
human genomic DNA coding human AID protein, which was amplified by PCR using 
various pairs of primers. 

pP)^FIGS22 schematically shows a degree of homology between an amino acid sequence 
ofmouse AID protein and that of human AID protein. The parts with a closed box are 
cytidine and deoxWytidylate deaminase zinc-binding region which is an AID protein active 
region. 

FIG. 23 schematically shows the structure of human genomic DNA including a gene 
coding human ADD protein. One to five shows exon 1, exon 2, exon 3, exon 4, and exon 5, 
respectively. 

FIG. 24 is a photograph indicating the expression state for human AID mRNA in 
various types of human tissues, analyzed by RT-PCR. 

FIG. 25 is a photograph indicating a location (localization) of human AID gene on 
human chromosome, analyzed by Fluorescence in situ hybridization (FISH) method. 

Two points at the tips of arrows show 12pl3 where human AID gene exists.. 

Like reference symbols in the various drawings indicate like elements. 
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DETAILED DESCRIPTION OF THE INVENTION 

The invention is illustrated in details by the following Examples, but not restricted to 
embodiments described in the Examples. 

Example 1: Culture of for mouse B cell clone CH12F3-2 and confirmation of properties 

Mouse B cell clone CH12F3-2 occurring class switch recombination (CSR) from IgM 
to IgA, several hours after the stimulation by IL-4, TGF-p, and CD40L, previously isolated 
by the present inventors, was cultured in the same manner as in the previous report 
(Immunity, 9:1-10, 1998; Curr. Biol., 8:227-230, 1998; Int. Immunol, 8:193-201, 1996). 

When the CH12F3-2 is stimulated by IL-4, TGF-p, and CD40L, a circular DNA 
including an S region (switch region) looped out by class switch recombination was detected 
several hours after the stimulation. 

The following manipulation was conducted according to the previous report (Curr. 
Biol, 8:227-230, 1998). 

The B cell CH12F3-2 stimulated by IL-4, TGF-p, and CD40L, and that which was 
not stimulated were cultured for 6 hours in the presence or absence of cycloheximide 
(200 ng/ml) which is a protein synthesis inhibitor, respectively. Genomic DNA was extracted 
from each cell, and PCR was conducted with the DNA as a template by following the 
standard method to amplify circular DNA including an S|i sequence and an Sa sequence. 
PCR was conducted by using a pair of primers, aFl and )liR3 and the other PCR was 
conducted by using a pair of primers, aFl and jiR3. 

As a control, genomic DNA coding glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH) was amplified by PCR. 

PCR product was subjected to the gel electrophoresis by ethidium bromide staining. 
FIG. 1(a) and FIG. 2 show the results. 

To confirm the presence or absence of the amplification of a circular DNA including 
the looped-out S region, Southern hybridization was conducted against the PCR product by 
using mouse Sa region gene for a hybridization probe, according to the standard method (L. 
Sambrook E. F., Tom Maniatis., Second edition, Ed. Molecular Cloning (Nolan, C, Ed.) 
Cold Spring Harbor, 1989). As an Sa gene, a 1,155 bp DNA fragment obtained by digesting 
10 kb EcoRI cleaved fragment IgH703 with Hind III and Earl was used (Genbank #D1 1468, 
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DNA No. 1993-3148) (J, Biol. Chem., Vol. 268, p. 4651-4665). FIGs. 1(b) and 2(b) show the 
results. 

It has been shown that mouse B cell CH2F3-2 produces the looped-out DNA 
containing the Sot sequence with the class switch recombination by the stimulation with 
cytokine, and the production of the DNA is inhibited by the presence of cycloheximide. This 
result suggested that occurrence of class switch recombination of an immunoglobulin gene 
needs a novel synthesis of a protein in the very early stage after the stimulation and the 
protein is deeply involved in the induction of the class switch. 

Example 2: Identification of a gene which expression is improved in mouse B cell CH12F3-2 
stimulated by cytokine 

A gene which is presumably expressed in the early stage after mouse B cell clone 
CH12F3-2 is stimulated, and presumably play a role of introducing class switch 
recombination of an immunoglobulin gene was attempted to be isolate from the CH12F3-2 
cells by the suppression subtract hybridization (SSH) (Proc. Natl. Acad. Sci. USA, 93:6025- 
6030, 1996; Anal. Biochem., 240:90-97, 1996) using the inhibitory PCR effect (Nucleic 
Acids Res., 23:1087-1088, 1995). 

A cDNA library necessary for subtraction cloning was prepared by using PCR-Select 
Subtraction Kit (CLONTECH, Catalogue NO: Kl 804-1) by following the instruction manual 
supplemented with the kit in the experimental manipulation. 

PolyA + RNA was isolated from each of mouse B cell clone CH12F3-2 stimulated with 
IL-4, TGF-p and CD40L for 5 hours, the same cells stimulated with the cytokines for 
12 hours, and the cells which were not stimulated, by following the reported method (Nucleic 
Acids Res., 26:911-918, 1998) and treated with DNasel to eliminate genomic DNA which 
may be mixed. Then cDNA was prepared based on each polyA + RNA sample using reverse 
transcriptase according to the standard method. Each cDNA prepared from mouse B cell 
clone CH12F3-2, treated with the above cytokines for 5 or 12 hours was mixed with same 
mole amounts to be used as a tester cDNA. On the other hand, cDNA derived from 
unstimulated cells was used as a driver cDNA. 

Subtraction was conducted by adding the driver cDNA into the tester cDNA 
according to the above previous report and the experimental manipulation manual. The 
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efficiency of subtraction was monitored by adding a small amount (1:1000 mole ratio) of O 
XI 74 phage DNA cleaved at the restriction enzyme site Hae III, as a control, into the tester 
cDNA. After the subtraction, the phage DNA was concentrated to a mole ratio of about 
100 times. 

5 The subtracted cDNA was cloned to T-vector (Promega) according to the standard 

method to prepare a plasmid library. In the same manner as in the previous report, 
2000 colonies in the library were screened by the differential hybridization method (Nucleic 
Acids Res., 26:911-918, 1998; Medical immunity, 29:17, p. 451-459, 1997). Each of the 
above tester cDNA and driver cDNA was radiolabeled to be used for hybridization. Clones 
10 including O XI 74 phage DNA were selected by hybridizing 0 XI 74 phage DNA with a 
replicant filter. 

One hundred fifteen clones emitting a stronger signal than the radio-labeled driver 
P cDNA probe against radio- labeled tester cDNA probe were identified and a nucleotide 

^ sequence of each clone was determined by using a DNA sequencer. 

n\ 1 5 Northern blotting was conducted against mRNA obtained from mouse B cell clone 

fg CH12F3-2 stimulated with IL-4, TGF-p and CD40L or the same lines unstimulated, using 

^ the radio-labeled DNA inserted into the each clone as a probe, according to the standard 

3 method (L. Sambrook, E. F., Tom Maniatis., Second edition, Ed. Molecular Cloning (Nolan, 

~i C, Ed,), Cold Spring Harbour, 1989). As a result, the enhanced expression corresponding to 

20 the stimulation with the above cytokines was observed in 23 among 115 clones. Gene 

fragments coding 7 different types of proteins, including genes coding the 3 kinds of known 
proteins and 4 kinds of novel proteins were found to be inserted into the 23 clones. 
Specifically, the expression of the 7 kinds of genes were found to be enhanced in mouse B 
cell clone CH12F3-2 by the stimulation with IL-4, TGF-p and CD40L. 
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<The known proteins> 
ABCD-1/MDC (8 clones) 
IFNg receptor (2 clones) 
I-a (MHC class II) (1 clone) 



30 
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<Novel proteins> 
23C9 (3 clones) 
15B11 (7 clones) 
8B9 (1 clone) 
5 16A9(1 clone 



As it has been known that the expression of the above I-a gene and ABCD/MDC gene 
is enhanced by stimulating mouse spleen B cell with IL-4 and CD40L, it was confirmed that 
the subtraction cloning was effectively conducted (J. Exp. Med., 188:451-463, 1998; 
10 Immunity, 5:319-330, 1996) 



Example 3: Expression of mRNA for a novel protein 23C9 in mouse B cell clone CH12F3-2 
^ The degree of enhanced expression of gene coding a novel protein 23 C9 in mouse 

vO B cell clone CH12F3-2 stimulated with IL-4, TGF-P and CD40L was analyzed according to 

m 15 the standard method (L. Sambrook, E. F., Tom Maniatis., Second edition, Ed. Molecular 
JJ. 5 Cloning (Nolan, C, Ed.), Cold Spring Harbour, 1989) by Northern blotting. 

00 Mouse B cell clone CH12F3-2 was cultured in the presence of one of the following 

D 

a regents for 12 hours. 

yy 

ft 20 (1) IL-4, TGF-p and CD40L only. 

g (2) Cycloheximide which is a protein synthesis inhibitor (200 ng/ml), only 

M= (3) IL-4, TGF-P and CD40L as well as Cycloheximide (200 ng/ml) 

Northern blotting was conducted against mRNA (10 jag for each group) obtained in 
25 the same manner as the previous report (Nucleic Acid Res., 26:91 1-918, 1998) from each 
group of treated cells using a radio-labeled cDNA fragment (1,020 bp)coding a novel protein 
23C9, obtained in the above Example, according to the standard method (L. Sambrook, E. F., 
Tom Maniatis., Second edition, Ed. Molecular Cloning (Nolan, C, Ed.), Cold Spring 
Harbour, 1989). 

30 As a control examination, Northern blotting was conducted for mRNA derived from 

B cell clone CH12F3-2 cultured without any one of the above cytokines, or cycloheximide. 
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The amount of mRNA to be electrophoresed was adjusted using the amount of 
mRNA in glyceraldehyde-3-phosphate dehydrogenase (GAPDH) as an index. DNA 
amplified by RT-PCR using GF primer and GR primer was used as a probe for blotting of 
GAPDH mRNA (Location of nucleotides: 566-1016, Genbank U5299) (Immunity, 9:1-10, 
1998). 

FIGs. 3 and 4 show the results. 

The expression of mRNA for a novel protein 23C9 was extremely strong in mouse B 
cell clone CH12F3-2 stimulated with IL-4, TGF-p and CD40L, while the expression in 
unstimulated cells was extremely weak. Expression of the mRNA in the stimulated cells was 
inhibited by the presence of a protein synthesis inhibitor. Moreover, in the stimulated cells, 
two bands indicating the expression of mRNA comprising different nucleotide lengths were 
detected. 

Expression of mRNA for a novel protein 23C9 in each of the following mouse cell 
lines which do not originally comprise class switch recombination was examined by 
Northern blotting in the same manner as in the above. 

B cell lines (lyD9, BA/F3, 70Z/3, WEHI231), T cell lines (EL-4, 2B4), myeloma cell 
lines (X63, HEHI-3). Fibroblast lines (L929, NIH3T3,) the other cell lines (F2, P815, ST2). 

The expression of mRNA for the novel protein 23C9 was not observed in any cells. 

Example 4: Cloning of a full length cDNA coding a novel protein 23C9. 

Four different positive clones were obtained by screening cDNA library (Nucleic 
Acids Res., 26:911-918, 1998) prepared from mouse B cell clone CH12F3-2 stimulated with 
IL-4, TGF-p, and CD40L, using a cDNA fragment (1,020 bp) coding the novel protein 23C9, 
obtained in the above Example as a probe. A nucleotide sequence of each clone was 
determined by using a DNA sequencer according to the standard method. 

One clone comprises a 1.2 kb nucleotide length and a single reading frame (OPF) 
with 1 polyadenylation site. The other 3 clone comprise a 2.4 kb nucleotide length and 2 
polyadenylation sites. A nucleotide sequence in the 1.2 kb part at the 5' side in the latter 
clones was identical to that of the nucleotides in the 1 .2 kb DNA in the former (SEQ ID 
NO:l). 
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Two different mRNA transcripts detected in Northern blotting in the above Example 
(FIGs. 3 and 4) were predicted to correspond to transcripts for each of the above 1.2 kb and 
2.4 kb, transcribed using the polyA site at 3' end and the polyA site at the 5'end. 

A cDNA fragment coding the novel protein 23C9 used as a probe in the above 
(1,020 bp) was found to be a nucleotide sequence of from 847 to 1866 in the full length 
cDNA of23C9. 

A nucleotide sequence near a first initiation codon in each cDNA was fit to Kozak's 
rule (Nucleic Acids Res., 15:8125-8148, 1987). In the 2.4kb cDNA, ATTTA which is a motif 
capable of mediating quick degradation of mRNA (Blood, 83:3182-3187, 1994) was present 
in the 2 sites in the untranslational region in the 3 'side. 

An open reading frame (ORF) of cDNA coding the novel protein 23C9 consisted of 
198 amino acids with the expected molecular weight of about 24kDa (SEQ ID NO:2). As a 
result of homology searching with known proteins by database, an amino acid sequence of 
ORF of the novel protein 23C9 comprised 34% amino acid homology with apolipoprotein B 
mRNA editing enzyme, catalytic polypeptide-1 (APOBEC-1) (Science, 260:1816-1819, 
1993; J. Biol. Chem., 268:20709-20712. 1993). GenBank and EMBL were used as DNA data 
base. SwissPlot was used as protein database. BLAST program (J. Mol. Biol., 215:403-410, 
1990) and FASTA program (Proc. Natl. Acad. Sci. USA., 85:2444-2448 1988) were used for 
database search. 

FIG. 5 shows an amino acid sequence of ORF of the novel protein 23 C9 and an 
alignment between the sequence and that of mouse APOBEC-1 amino acid sequence. 

As a result of motif search on online using PROSITE (Nucleic Acids Res., 1 1 :2013- 
2018, 1992), the APOBEC-1 like novel protein 23C9 comprises cytidine/deoxycytidine 
deaminase motif which is conserved in a amino acid sequence of a protein belonging to 
cytosine nucleoside/ nucleotide deaminase family which constructs a large family and is an 
activation site of a deaminase activity. A cytosine nucleoside/nucleotide deaminase family is 
classified into RNA editing deaminase, cytidine/deoxycytidylate deaminase, and CMP/dCMP 
deaminase based on the substrate specificity and homology in the activation sites (Cell, 
81:187-195, 1995). 

A phylogenic tree was prepared based on the alignment among regions in APOBEC-1 
which is an RNA editing deaminase, cytosine nucleoside deaminase, cytosine nucleotide 
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deaminase, and cytidine deaminase motif in the novel protein 23C9. The sequences in the 
known proteins used for the comparison were obtained from GenBank, as follows. 

Human derived nucleoside deaminase: L27943 

Mouse derived nucleoside deaminase: AA388666 

S. subtilis derived nucleoside deaminase: U18532 

E. coli derived cytidine deaminase: X63144 

Rabbit derived APOBEC-1: U10695 

Human derived APOBEC-1: L25877 

Rat derived APOBEC-1 : U10695 

Mouse derived APOBEC-1: U21951 

T2/T4 phage derived nucleotide deaminase: JOS 172 

Human derived nucleotide deaminase: LI 2 136 

5. cerevisiae derived nucleotide deaminase: U10397 

FIG. 6 shows the result. Cytidine deaminase motif in the novel protein 23 C9 was 
rather relative to a subgroup of RNA editing deaminase than subgroups of nucleoside 
deaminase and nucleotide deaminase. 

On the other hand, a leucine-rich region existing at the C-terminus of APOBEC-1 is 
thought to be important for protein-protein interaction (Proc. Natl. Acad. Sci. USA., 91:8522- 
8526, 1994; J. Biol. Chem., 269:21725-21734, 1994). The novel protein 23C9 also 
comprised a leucine-rich region at the C-terminus. Four leucines in the region o 23 C9 were 
conserved in the leucine rich regions of APOBEC-1 in rabbit, rat, mouse and human. 

It has been known that Phe66, Phe87, His61, Glu63 and Cys93 are essential for 
binding of APOBEC-1 to RNA, and all these amino acid residues were conserved in the 
primary structure of 23C9 (Trends Genet., 12:418-424, 1996; Cell, 81:187-195, 1995; J. Biol. 
Chem., 270:14768-14775 1995; J. Biol. Chem., 270:14762-14767, 1995). From this fact, 
23C9 protein is predicted to comprise an RNA editing deaminase activity. 

Moreover, cytidine deaminase derived from APOBEC-1 and E. coli (ECCDA) are 
known to comprise a pseudoactive site domain at the C-terminus and the 23C9 protein also 
comprised a pseudoactive site domain same as in the APOBEC-1. This indicates that 23C9 
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protein is more relative to APOBEOl and ECCDA than deaminase proteins in the other 
groups. 

From these facts, the novel protein 23C9 was named activation-induced cytidine 
deaminase (AID). The novel protein 23C9 was called AID hereafter. 

5 

Example 5: Preparation of the AID-GST fusion protein 

The cDNA coding a full length AID cloned in the above Example was amplified by 
PCR with a pair of primers, AID- 138 (SEQ ID NO:3) and AID- 161 (SEQ ID NO:4), a pair of 
primers, AID-1 18 (SEQ ID NO:5) and AID-1 19 (SEQ ID NO:6) and Taq Polymerase by 
10 following the standard method. As there is an intron between AID-1 18 and AID-1 19, a PCR 
product derived from AID genomic DNA can be easily distinguished. 

The obtained PCR product was subcloned to pGEX4Tl vector (Pharmacia) according 
n to the standard method. A nucleotide sequence of the vector was determined and the absence 

of point mutation derived from the use of Taq polymerase in the full length AID cDNA 
On 1 5 cloned to the vector was confirmed. 

~ E. coli DH5a was transformed with the vector according to the standard method. The 

2 obtained transformants were cultured, and a full length AID cDNA was expressed as a fusion 

g protein with glutathione S-transferase (GST). The AID-GST fusion protein was extracted in 

^ the same manner as in the previous report, and purified using glutathione agarose affinity 

fU 20 chromatography (J. Biol. Chem., 270:14768-14775 1995). 

to 

p A molecular weight of the purified AID-GST fusion protein was analyzed by 

^ following the standard method using 10% SDS-PAGE and silver staining. A protein 

extracted from wild type E. coli DH5a was used as a control. FIG. 7 shows the result. 

As expected, the fusion protein was detected as a band comprising a molecular weight 
25 of about 49 kDa. Minor bands detected under the about 49 kDa were thought to be 
decomposed proteins, frequently generated in the purification process in general. 

A molecular weight of the purified AID-GST fusion protein was analyzed by the 
Western blotting method according to the standard method (Genomics, 54:89-98, 1998). 
Anti-AID protein antibody to be used for the assay was prepared by immunizing a 
30 commercial rabbit for the experiment with multiple antigen peptides including synthetic 
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peptides corresponding to amino acid NO: 1 16 to 132 of the AID protein of the present 
invention (Proc. Natl. Acad. Sci. USA., 85:5409, 1988). 
FIG. 8 shows the result. 



5 Example 6: Cytidine deaminase activity of the AID protein 

A cytidine deaminase activity of AID was measured by the same method as in the 
previous report (J Biol. Chem. 270:14768-14775, 1995). 

The purified AID-GST fusion protein prepared in the above (2, 4, 6, 8, 10, 20, 40, 60, 
100, 200, 300, 400, and 600 ng) was incubated in the buffer (pH 7.5, the total amount 10 jal) 
10 containing 45mM Tris with 3.3 \iCi [ 3 H] deoxycytidine (24.8 Ci/mmol, DuPont) and 250 ^iM 
cytidine for 2 to 4 hours. The reaction was terminated by adding deoxycytidine (2 jal of 
10 ^ig/ml) and deoxyuridine (2 jul of 10 jxg/ml). Insoluble substances were removed by 
B centrifiigation, and the reaction mixture (4 j^l) was subjected to the polyethylene imine- 

cellulose thin layer chromatography plate (VWR). The plate was developed in isopropyl 
m 15 alcohol / 10% HC1 (7:2 v/v). The plate was exposed to ultraviolet light (254 nm) for 
^ visualization and bands corresponding to deoxycytidine and deoxyuridine were collected, 

ffi added to Ultima Gold scintillation solution to be quantified by liquid scintillation photometer 

O 

g (Packard) 

O 

^ FIG. 9 shows the result. As a result, AID protein showed the cytidine deaminase 

ry 20 activity depending on the concentrations. 

g An inhibitory effect of tetrahydrouridine (THU; 0 to 40 \M) (Calbiochem, USA) 

^ which is an inhibitor specific to cytidine deaminase, on the cytidine deaminase activity in the 

AID-GST fusion protein (300 ng) was measured by the same method described above. 
FIG. 10 shows the result. The cytidine deaminase activity of AID protein was 
25 inhibited dependently on the concentrations of THU. 

Each inhibitory effect of 1,10-o-phenanthroline (0 to 20 mM) which is a zinc- 
chelating agent and its inactive form isomer 1 ,7-o-phenanthroline (0 to 20 mM) on the 
cytidine deaminase activity in the AID-GST fusion protein was measured in the same manner 
as described below. 

30 FIG. 1 1 shows the result. The cytidine deaminase activity of AID protein was 

inhibited by 20 mM 1,10-o-phenanthroline by about 91%. 1,7-o-phenanthroline which is the 
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inactive isomer only inhibited about 13%. These results indicate that AID protein is a zinc- 
dependent cytidine deaminase, same as APOBC-1. 

Example 7: Avidity of AID protein with AU-rich RNA 
5 A recombinant APOBEC-1 binds to Au-rich RNA (Trends Genet., 12:418-424, 1996; 

Cell, 81:187-195, 1995; J. Biol. Chem., 270:14768-14775, 1995; J. Biol. Chem., 270:14762- 
14767, 1995), and progresses RNA editing for apoB in the presence of chicken extract 
including co-factor. 

Since the AID protein has a functional cytidine deaminase activity as well as a 
10 structural similarity with APOBEC-1, to examine an RNA editing activity in the AID 
protein, avidity to AU-rich RNA (5-AU) and apoB RNA which are RNA substrate for 
APOBEC-1 was examined. 

The AID protein did not show avidity to AU-rich RNA (5-AU) in the gel retardation 
%y assay. In in vitro apoB RNA assay, conversion from cytidine (C) to uridine (U) was not 

m 15 observed. 

OS Example 8: Expression distribution of AID mRNA in tissues 

7 The expression of AID mRNA in each tissue was examined by Northern blotting 

™ according to the standard method (L. Sambrook, E. F., Tom Maniatis., Second edition, Ed. 

fU 20 Molecular Cloning (Nolan C, Ed.), Cold Spring Harbour, 1989; Experimental Medicine, 
J Suppl,., "Genetic Engineering Hand Book", published by Yodosha, p. 133-140, 1992). 

PolyA + RNA (2 ^g each) obtained from cells derived from each tissue in mice 
(muscle, spleen, lung, heart, lymph node, brain, kidney, thymus, testis, liver) according to the 
previous report (Nucleic Acids Res., 26:91 1-918, 1998) was used as a sample. Radiolabeled 
25 cDNA fragment (1,020 bp) coding AID (23C9) obtained in the previous Examples was used 
as a probe for blotting polyA + RNA. 

As a control, mRNA of glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) was 
blotted in the same manner. AS a probe for blotting GAPDH mRNA, DNA amplified by 
PCR using GP primer and GR primer was used. (Nucleotide location: 566-1016, Genbank, 
30 U52599) (Immunity, 9:1-10, 1988). 

FIG. 12 shows the result. 
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As a result, AID mRNA was strongly expressed in mesenteric lymph node. In 
addition, weak expression was observed in spleen. 

Example 9: Expression of AID mRNA in various lymphatic tissues. 
5 The expression of AID mRNA in each lymphatic tissue was analyzed by RT-PCR 

according to the standard method (Immunity, 9:1-10, 1998). 

cDNA was prepared according to the standard method using polyA + RNA obtained 
from cells derived from various lymphatic tissues (Payer's patch, mesenteric lymph node, 
axillary lymph node, spleen, bone marrow, thymus) in the same manner as in the previous 
10 report (Nucleic Acids Res., 26:91 1-918, 1998), for mRNA as a sample, as a template. AID 
cDNA and GAPDH cDNA were amplified using the obtained cDNA as a template. The pair 
of primers, AID-138 (SEQ ID NO:3) and AID-161 (SEQ ID NO. 4) in the above, a pair of 
a primers AID-1 18 (SEQ ID NO:5) and AID-1 19 (SEQ ID NO:6) and Taq polymerase were 

tfj used for PCR of AID cDNA. As there is an intron between AID-1 18 and AID-1 19, a PCR 

fp 15 product derived from the AID genomic DNA sequence can be easily distinguished. 
^ FIG. 13 shows the result. 

W AID cDNA was detected in all lymphatic tissues except for thymus. In particular, the 

O 

s obvious expression was observed in peripheral lymphatic organs, such as lymph node or 

g Payer's patch. On the other hand, the expression in primary lymphatic organs was weak in 

ry 20 comparison with that in the peripheral lymphatic organs. 

CD 

o 

H= Example 10: Expression of AID mRNA as time goes in activated mouse B cell clone 

CH12F3-2 

Expression of AID mRNA as time goes in activated mouse B cell clone CH12F3-2 
25 stimulated with IL-4, TGF-(3, and CD40L for 0 to 60 hours was analyzed by Northern 
blotting according to the standard method (L. Sambrook, E. F., Tom Maniatis., Second 
edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Harbor, 1989). 

Mouse B cell clone CH12F3-2 was cultured in the presence of IL-4, TGF-p, and 
CD40L for the various periods (0, 3, 5, 12, 24, 36 48 or 60 hours). 
30 Northern blotting was conducted against mRNA (10 |ig in each group) obtained from 

each culture group in the same manner as in the previous report (Nucleic Acids Res., 26:91 1- 
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918, 1998) using a radiolabeled cDNA fragment coding AID (23C9) obtained in the previous 
Examples, as a probe, according to the standard method (L. Sambrook, E. R, Tom Maniatis., 
Second edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Harbor, 1989). 

The amount of mRNA to be gel-electrophoresed was adjusted by using mRNA of 
GAPDH as an index. DNA amplified by RT-PCR using GF primer and GR primer was used 
as a probe for blotting GAPDH mRNA (Nucleotide location: 566-1016, Genbank U52599) 
(Immunity, 9:1-10, 1998). 

FIG. 14 shows the result. 

It was shown that The expression of AID mRNA in mouse B cell clone CH12F3-2 
was too small to be detected without the stimulation by cytokines, but that the expression was 
initiated 3 hours after the stimulation by cytokines (described in the above), was maximum 
12 hours after the stimulation (more than about 15 times), and was gradually decreased from 
48 hours after the stimulation. 

Example 1 1 : Cytokine specificity to inducing expression of AID mRNA in mouse B cell 
clone CH12F3-2. 

Cytokine specificity to inducing expression of AID mRNA in mouse B cell clone 
CH12F3-2 was analyzed by Northern blotting according to the standard method L. 
Sambrook, E. F., Tom Maniatis., Second edition Ed. Molecular Cloning (Nolan, C, Ed.), 
Cold Spring Harbour, 1989). 

Mouse B cell clone CH12F3-2 was cultured in the presence of various combinations 
of cytokines (one or more selected from IL-4 TGF-p, CD40-L) for 12 hours. Northern 
blotting was conducted against mRNA (10 ^ig in each group) obtained from each culture 
group in the same manner as in the previous report (Nucleic Acids Res., 26:91 1-918, 1998) 
using a radio-labeled cDNA fragment (1,020 bp) coding AID (23C9) obtained in the previous 
Example, according to the standard method (L. Sambrook, E. F., Tom Maniatis., Second 
edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Harbor, 1989). 

The amount of mRNA to be gel-electrophoresed was adjusted by using mRNA of 
GAPDH as an index. DNA amplified by RT-PCR using GF primer and GR primer was used 
as a probe for blotting GAPDH mRNA (Nucleotide location: 566-1016, Genbank U52599) 
(Immunity, 9:1-10, 1998). 
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FIG. 15 shows the result. 

Expression induction of AID-mRNA was small by solely any one kind of cytokines. 
On the other hand, when 3 kinds of cytokines described above were used at the same time, 
the maximum expression induction of AID-mRNA was observed. 

As described in the above Example 3, because expression induction of AID mRNA 
was inhibited by cycloheximide which is an inhibitor for protein synthesis, it is hypothesized 
that enhanced expression of AID mRNA needs de novo synthesis. 

Example 12: Expression induction of AID mRNA in spleen B cell by stimulation 

The presence or absence of expression induction of AID mRNA by stimulation which 
may activate B cell and induce class switch recombination of immunoglobulin was 
examined. 

Spleen B cell was purified and obtained from BALB/c mouse (6 to 12-week old, 
Shimizu Experimental Materials (SLC)) according to the standard method. Dead cells and 
cell fragments were removed by Ficoll density gradient centrifugation after the process of 
removing T cells. The purified spleen B cell was cultured for 4 days in the presence of a 
stimulus in various combinations (one or more selected from IL-4, TGF-p, CD40L and LPS 
(lipopolysaccharide) in the same manner as in the previous report (Nucleic Acids Res., 
26:91 1-918, 1998). LPS derived from Salmonella typhosa (50 ng/ml, Sigma) was used. 

Northern blotting was conducted against mRNA (15 jig in each group) obtained from 
each culture group in the same manner as in the previous report (Nucleic Acids Res., 26:91 1- 
918, 1998) using a radio-labeled cDNA fragment coding AID (23C9) obtained in the 
previous Example, according to the standard method (L. Sambrook, E. F., Tom Maniatis., 
Second edition Ed. Molecular Cloning (Nolan, C, Ed.). Cold Spring Harbor, 1989). 

The amount of mRNA to be gel-electrophoresed was adjusted by using mRNA of 
GAPDH and 28S ribosomal RNA as an index. DNA amplified by RT-PCR using GF primer 
and GR primer was used as a probe for blotting GAPDH mRNA (Nucleotide location: 566- 
1016, Genbank U52599) (Immunity, 9:1-10, 1998). 

FIG. 16 shows the result. 

The enhanced expression of AID mRNA by the stimulation with LPS only or 
LPS+IL-4, or LPS+TGF-P was observed in normal mouse spleen B cells. 
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Example 13: Induced expression of AID mRNA in vivo 

It was examined whether the expression induction of AID mRNA by various 
stimulation in vitro would occur in vivo. 

BALB/c mouse (6 to 12-week old, five individuals in each group, SLC) was 
immunized by intraperitoneally administering sheep red blood cell (SRBC) (1 X 10 8 cells, 
Cosmo Bio.). In the living body immunized by SRBC, it has been known that clonal 
expansion and germinal center formation occur after immunoresponse, and class switch 
recombination of an immunoglobulin gene and affinity maturation are caused. 

PolyA + RNA was prepared from splenocytes isolated from spleen excised from each 
individual before (day 0) and after (day 2, 5 and 13) the immunization. 

The PolyA + RNA (2 jxg each) was subjected to the Northern blotting using the 
radiolabeled cDNA fragment (1,020 bp) coding AID (23C9) as a probe in the same manner 
as the above Examples. The amount of mRNA to be gel-electrophoresed was adjusted using 
mRNA of GAPDH as an index in the same manner as in the above Examples. 

FIG. 17 shows the result. 

The minimum amount of expressed AID mRNA was detected before immunization of 
SRBC (day 0), however, a significant enhancement of expression (about 4 to 5 times) was 
observed day 5 and day 13 after the immunization. 

Moreover, to analyze in which cell type enhanced expression of AID mRNA occurs, 
RT-PCR was conducted by the standard method (Immunity, 9:1-10, 1998). 

Red blood cells were removed from splenocytes obtained from spleen which was 
obtained 5 days after the immunization of SRBC in the same manner as the above, and 
T cells and non- T cells were separated using nylon fiber (Wako Pure Chemicals) in the same 
manner as in the previous report (Eur. J. Immunol., 3:645-649, 1973). T cell fraction 
contained more than 90% of CD3 positive cells, and less than 20% B 220 positive cells. 
T-cell fraction (removal of B cells) and B-cell fraction were concentrated by MACS method 
with magnetic beads conjugated to anti-CD19 antibody (Miltenyi Biotech.). B220 positive 
B cells included in the fraction in which T cells were removed were 5% or less. On the other 
hand, B220 positive B cells included in the fraction in which CD 19 positive cells were 
concentrated were 60% or more. 
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cDNA was prepared by reverse transcriptase according to the standard method using 
polyA + RNA prepared from each fractionated cell group. AID cDNA and GAPDH cDNA 
were amplified by PCR using the obtained cDNA as a template. For PCR of AID cDNA, the 
previously described pair of primers, AID-138 (SEQ ID NO:3) and AID-161 (SEQ ID 
5 NO:4), and the previously described pair of primers, AID-1 18 (SEQ ID NO:5) and AID-1 19 
(SEQ ID NO: 6), as well as Taq polymerase were used. 

FIG. 18 shows the result. 

As a result, in the CD 19 positive B cell fraction and non-T cell fraction, amplification 
of AID cDNA was observed. Specifically, it was demonstrated that enhanced expression of 
10 AID mRNA induced by immunization by SRBC occurs in spleen CD 19 positive B cells. 
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Example 14: Localization of AID mRNA expression in lymphatic organs 
m It was found that timing of enhanced expression of AID mRNA in spleen almost 

C 5 consistent with the initiation of germinal center (GC) formation after immunization of SRBC, 

in 1 5 from the result of the previous Examples. In this examination, an precise localization of AID 
mRNA expression in lymphatic organs was analyzed using in situ hybridization. 

AID cDNA cleaved out by digesting pGEX4Tl vector in which cDNA coding AID 
protein has been subcloned, with EcoRI and Xhol was subcloned into plasmid pBluesciptSK 
Q (+) (Stratagene). The plasmid was digested with EcoRI or Xhol to obtain linearized plasmid 

fy 20 DNA and transcribed into RNA using the plasmid as a template in the presence of 
S digoxigenin-labeled rUTP (Boehringer-Mannheim) using T3 RNA polymerase or T7 RNA 

H= polymerase to prepare each of digoxigenin-labeled antisense probe and sense probe. 

On the other hand, frozen tissue slices were prepared by immobilizing with 
paraformaldehyde from each of spleen and payer's patch in a normal mouse as a lymphatic 
25 organ sample. A normal mouse was immunized with SRBC in the same manner as in the 
above Examples, and frozen tissue slices immobilized with paraformaldehyde from spleen 
obtained 5 day after the immunization. 

Hybridization was conducted by applying the digoxigenin-labeled antisense AID 
probe or sense AID probe to each of the slides furnished with each of immobilized slices. 
30 Hybridized digoxigenin-labeled AID probe was detected using anti-digoxigenin antibody 
conjugated with alkaline phosphatase. The localization of anti-digoxigenin antibody 
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conjugated to digoxigenin on the probe was identified by detecting a phosphatase reactant 
(dark purple color). This analysis was conducted using a light transmission microscope. 

In situ hybridization and detection of riboprobe in this examination were conducted in 
the same manner as in the previous report (J. Comp. Neurol., 333:398-416, 1993). 

The location of germinal center in each tissue slice was identified by staining with 
PNA (Vector) conjugate with FITC and observing with a inflorescent microscope. 

FIGs.19 and 20 show the result. 

In the examination using the antisense AID probe, multiple obvious focal signals 
were observed in spleen tissue slices derived from SRBC immunized mouse (day 5 after the 
immunization) (FIGs. 19(E) and 20(E)), however, any signals were not detected in spleen 
tissue slices derived from mice which were not immunized with SRBC (FIGs. 19(B) and 
20(B)). This result is consistent with the result of Northern blotting obtained in the above 
Example (FIG. 17). Existence of germinal center was observed both in spleen tissue slices 
derived from SRBC-immunized mouse (day 5 after the immunization) (FIGs. 19(F) and 
20(F)) as well as in the normal payer's patch (FIG. 20(1)), by staining with FITC-labeled 
PNA. The expression of AID mRNA was found to localize in germinal center in the both 
tissue slices. 

In the examination using sense AID probe, any signals as background were not 
detected in tissue slices of spleen or of payer's patch regardless of presence or absence of the 
immunization by SRBC. 

This result indicates that expression induction of AID mRNA occurs in specifically to 
germinal center B cells, activated by stimulation with an antigen. 

Example 15: Isolation of genomic DNA coding AID protein derived from human 
15-1 Preparation of probes for hybridization 

Q$ J7PCR w^s conducted using an expression vector prepared by inserting cDNA coding a 
ftlfl length mouseSAID protein, prepared in Example 5 into a plasmid vector pGEX4Tl, as a 
template with a pair\f primers (Primer 170: SEQ ID NO: 16 and primer 181: SEQ ID 
NO: 1 79, according to the standard method described in the above. 

The obtained PCR product was purified by the standard method described above and 
a nucleotide sequence of the purified DNA was determined by the direct sequence method to 
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confirm that the purified DNA is the nucleotide sequence coding a full length of mouse AID 
protein. This purified DNA was used as a probe for hybridization in the following 
experiments. 



5 15-2 Screening of human genomic DNA library 

The probe prepared in the above was labeled in the same manner as for the 
radioactive probe in the above Northern hybridization to make a probe radio-labeled by a 
radioactive isomer. 

Using the labeled probe, a commercial human genomic DNA library (catalogue No. 
10 HL1067j; Lot No. 45003; CLONTECH) was screened by the cross hybridization according 
the standard method. 

Washing after the hybridization was conducted twice in 2 X SSC (including 0.1% 
SDS, under the room temperature, 10 min), and twice in 2 X SSC (including 0.1% SDS, 
65°C, 30 min). Phage DNA was subcloned by purifying phage DNA and inserting about 22 
15 kb genomic DNA obtained by cleaving at NotI restriction enzyme site in the phage DNA, 
into Not I restriction enzyme site in plasmid pZero-2.1. This plasmid was named 3CpZero. 

A DNA fragment obtained by digesting 3CPZero with PstI was ligated to the PstI site 
of plasmid pBlueScript KS (Toyobo) and E. coli was transformed with this ligated DNA. 
Transformants were screened by the colony hybridization using the labeled probe 
20 prepared in the above according to the standard method, and multiple positive clones were 
obtained. 

A nucleotide sequence of human genomic DNA inserted into each positive clone was 
analyzed and multiple clones containing genomic DNA of DNA coding human AID protein 
were identified. 

25 Among the multiple clones, nucleotide sequences of genomic DNA containing DNA 

coding human AID protein contained in two clones were described in SEQ ID NOs:9 and 10, 
respectively. 

Moreover, a nucleotide sequence of genomic DNA including DNA coding human 
AID protein included in the positive other clone was shown in SEQ ED NO:35. 
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Example 16: Isolation of cDNA coding a full-length human AID protein and preparation of 
human AID protein 

By comparing a nucleotide sequence of genomic DNA including a coding region of 
the obtained human AID protein with cDNA nucleotide sequence coding a full-length mouse 
AID protein determined in the above, a human AID protein coding region in the human 
genomic DNA was deduced. 

A pair of primers for RACE-PCR was designed based on the deduced nucleotide 
sequence of the coding region in the human AID protein (Primer 22: SEQ ID NO: 18, and 
primer 25: SEQ ID NO: 19). 

RACE-PCR was conducted using mRNA prepared from human B Lymphoma cell 
line RAMOS as a template with the above pair of primers according to the previous report 
(J. Biol. Chem., 274:18470-18476, 1999) by following the standard method. A nucleotide 
sequence of the obtained PCR product was determined and cDNA coding a full length human 
AID protein was obtained (cDNA sequence: SEQ ID NO:7, and amino acid sequence: SEQ 
IDNO:8). 

<y JA S a rd^ult, human AID protein (SEQ ID NO:8) has extremely high homology in 
no acid sequences with a mouse AID protein (SEQ ID NO:2) (FIG. 22). Amino acid 
sequences in Cytiaine and deoxycytidilate deaminase zinc-binding region which is an active 
region in AID protein\(both mouse AID and human AID SEQ ID NO:56 to 94) were 
completely consistent (conserved) between mouse and human. 

As a partial amino acid sequence (amino acid NO: 1 16 to 132 in SEQ ID NO:2) of 
mouse AID protein used for the preparation of anti-AID protein antibody (Example 5) was 
completely consistent with a corresponding amino acid sequence (amino acid NO: 1 16 to 132 
in SEQ ID NO: 8) of human AID protein, the anti-AID protein antibody was expected to 
comprise cross-reactivity not only with mouse AID protein but also with human AID protein. 

Human AID cDNA obtained in the above was reconstructed according to the standard 
method in the manner of genetic engineering so that His-AID fusion protein added with His- 
tag (a peptide of histidine repeated 10 times) at N-terminal in the human AID protein was 
produced, and an expression vector was prepared by inserting the cDNA into a plasmid pEF- 
BOS (Unexamined published Japanese patent No. Hei 2-242687). The vector was introduced 
into monkey kidney derived cell line COS7 by lipofection using LIPOFECTAMINE (GIBCO 
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BRL) according to the standard method. The obtained transgenic cells were cultured by the 
standard method and His-human AID fusion protein was transiently expressed. His-human 
AID fusion protein was extracted and purified in the same method as the previous report, and 
the production of His-human AID fusion protein was analyzed by Western blotting with the 
anti^AIDjintibody prepared in Example 5 and a commercial anti-His tag antibody according 
to the standard method. As a result, the His-AID protein was detected as a band comprising 
about 3 1 kDa molecular weight in all cases using any antibody. 

Example 17: Determination of exons in genomic DNA coding human AID protein 

Based on the information for the nucleotide sequence o cDNA coding the full length 
human AID protein above, exons in the nucleotide sequences for genomic DNA coding 
human AID protein in the above were determined. 

As a result, it was confirmed to consist of 5 exons. 

Exon 1 (Nucleotide sequence: SEQ ID NO: 11); 
Exon 2 (Nucleotide sequence: SEQ ID NO: 12); 
Exon 3 (Nucleotide sequence: SEQ ID NO: 13); 
Exon 4 (Nucleotide sequence: SEQ ID NO: 14); and 
Exon 5 (Nucleotide sequence: SEQ ID NO: 15). 
The exon 1 contains a translation initiation codon ATG which codes the first 
methionine (Amino acid No: 1 of SEQ ID NO:8) in human AID protein, and the initiation 
codon corresponds to nucleotide NOs: 80 to 82 in SEQ NO: 1 1 . 

Specifically, the genomic DNA including DNA coding human AID obtained in the 
above Examples (SEQ ID NO:9, SEQ ID NO: 10 and SEQ ID NO:35) consists of introns and 
exons described below and comprises a full length of about 1 1 kb. FIG. 23 schematically 
shows the structure. 

<SEQ ID NO:9> 

Intron: Nucleotide NOs: from 1 to 1031 
Exon 1: Nucleotide NOs: from 1032 to 1118 
Intron: Nucleotide NOs: from 1119 to 5514 
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<SEQIDNO:10> 

Intron: Nucleotide NOs: from 1 to 1064 
Exon 2: Nucleotide NOs: from 1065 to 1212 
Intron: Nucleotide NOs: from 1213 to 2591 

5 Exon 3 : Nucleotide NOs: from 2592 to 2862 

Intron: Nucleotide NOs: from 2863 to 3155 
Exon 4: Nucleotide NOs: from 3156 to 3271 
Intron: Nucleotide NOs: from 3272 to 3740 
Exon 5: Nucleotide NOs: from 3741 to 5912 

10 Intron: Nucleotide NOs: from 591 3 to 6564 



<SEQ ID NO:35> 
q Intron: Nucleotide NOs: from 1 to 441 

■A Exon 1 : Nucleotide NOs: from 442 to 528 

(p 15 Intron: Nucleotide NOs: from 529 to 6279 

|j Exon 2: Nucleotide NOs: from 6280 to 6427 

63 Intron: Nucleotide NOs: from 6428 to 7806 

r Exon 3 : Nucleotide NOs: from 7807 to 8077 

5 Intron: Nucleotide NOs: from 8078 to 8370 

FU 20 Exon 4: Nucleotide NOs: from 8371 to 8486 

□ Intron: Nucleotide NOs: from 8487 to 8955 

^ Exon 5 : Nucleotide NOs: from 8956 to 1 1 067 

Intron: Nucleotide NOs: from 11068 to 11204 



25 Example 1 8: Amplification of a given partial nucleotide sequence of genomic DNA coding 

human AID protein by PCR and diagnosis for the presence or absence of mutation in the 

partial nucleotide sequence 

The AID protein of the present invention may^involve in sideration of various ^ 

immunodeficiency and allergic disease. For example, a given immunodeficiency or allergic 
30 disease may be caused by mutation or deletion in the nucleotide sequence of genomic DNA 

(especially exon) coding an AID protein. 
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The presence or absence of such mutation or deletion in genomic DNA can be 
analyzed by, for example, following examples. 

(1) A pair of primers comprising a nucleotide sequence complementary to a given 
partial nucleotide sequence of genomic DNA coding AID protein in the present invention is 
prepared. 

(2) Using genomic DNA coding AID protein obtained from tissues or cells of a 
patient suffering from immunodeficiency or allergic disease as a template, an objective 
partial nucleotide sequence of the genomic DNA is amplified with the pair of primer DNA. 

(3) By analyzing the presence or absence of a PCR product and a nucleotide 
sequence of the PCR product, and comparing the nucleotide sequence with a corresponding 
nucleotide sequence in genomic DNA coding AID protein derived from a normal person, 
mutation or deletion in the genomic DNA is identified. 

Specifically, this method enables, for example, not only elucidate relationship 
between immunodeficiency or allergic disease and AID protein, but also diagnose the 
diseases by the above method in the case that AID protein is a cause of sideration of a given 
type of disease (for example immunodeficiency or allergic disease). 

For the above purpose, the following 15 kinds of primers were designed and prepared 
based on a given partial nucleotide sequence in the genomic DNA coding human ATP 
protein. 

Primer: p3 (SEQ ID No. 20) 
Primer: p9 (SEQ ID No. 21) 
Primer: plO (SEQ ID No. 22) 
Primer: pl2 (SEQ ID No. 23) 
Primer: pl4 (SEQ ID No. 24) 
Primer: pl6 (SEQ ID No. 25) 
Primer: pi 7 (SEQ ID No. 26) 
Primer: pl9 (SEQ ID No. 27) 
Primer: p26 (SEQ ID No. 28) 
Primer: p29 (SEQ ID No. 29) 
Primer: p36 (SEQ ID No. 30) 
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Primer: p48 (SEQ ID No. 31) 
Primer: p59 (SEQ ID No. 32) 
Primer: p85 (SEQ ID No. 33) 
Primer: p86 (SEQ ID No. 34) 

By PCR using the above primers as a pair of primers by the following combinations, 
and a genomic DNA isolated from human B lymphoma cell RAMOS as a template, a partial 
nucleotide sequence coding each target human AID protein was amplified. FIG. 21 shows 
relative locations of genomic DNA partial nucleotide sequences amplified by each primer 
pair. 

(1) DNA comprising nucleotide sequence of SEQ ID NO:3 1 and DNA 
comprising nucleotide sequence of SEQ ID NO:32; 

(2) DNA comprising nucleotide sequence of SEQ ID NO:20 and DNA 
comprising nucleotide sequence of SEQ ID NO:22; 

(3) DNA comprising nucleotide sequence of SEQ ID NO:2 1 and DNA 
comprising nucleotide sequence of SEQ ID NO:30; 

(4) DNA comprising nucleotide sequence of SEQ ID NO:24 and DNA 
comprising nucleotide sequence of SEQ ID NO:25; 

(5) DNA comprising nucleotide sequence of SEQ ID NO:23 and DNA 
comprising nucleotide sequence of SEQ ID NO:27; 

(6) DNA comprising nucleotide sequence of SEQ ID NO:23 and DNA 
comprising nucleotide sequence of SEQ ID NO:28; 

(7) DNA comprising nucleotide sequence of SEQ ID NO:23 and DNA 
comprising nucleotide sequence of SEQ ID NO:29; 

(8) DNA comprising nucleotide sequence of SEQ ID NO:26 and DNA 
comprising nucleotide sequence of SEQ ID NO:27; 

(9) DNA comprising nucleotide sequence of SEQ ID NO:26 and DNA 
comprising nucleotide sequence of SEQ ID NO:28; 

(10) DNA comprising nucleotide sequence of SEQ ID NO:26 and DNA 
comprising nucleotide sequence of SEQ ID NO:29; 
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(11) DNA comprising nucleotide sequence of SEQ ED NO:34 and DNA 
comprising nucleotide sequence of SEQ ID NO:28; 

(12) DNA comprising nucleotide sequence of SEQ ID NO:34 and DNA 
comprising nucleotide sequence of SEQ ID NO:29; 

(13) DNA comprising nucleotide sequence of SEQ ID NO:33 and DNA 
comprising nucleotide sequence of SEQ ID NO:29; or, 

(14) DNA comprising nucleotide sequence of SEQ ID NO: 1 8 and DNA 
comprising nucleotide sequence of SEQ ID NO: 19; 

The condition for PCR was set by the following manner. 

<Reaction solution> 

A total amount of 20.2 jxl consisting of DDW (8 ^1), 10 X buffer (2 jitl), dNTP 
(2.5 mM each, 2 jil), 2 mM primer 1 (2 ^1), 2 jaM primer 2 (2 jlxI), genomic DNA isolated 
from human B Lymphoma cells (185 ng/^1) and Taq polymerase (5 U/ml, 0.2 jal), Ex Taq 
(TAKARA), or Ampli Taq (Perkin Elmer). 

<Reaction> 

Reaction was conducted by the following (A) or (B). 

(A) 1 cycle (reaction at 94°C for 30 sec) and 40 cycles (reaction at 94°C for 

10 sec, reaction at 54°C for 30 sec, and reaction at 72°C for 3 min and 30 sec) and stored at 
4°C 

(B) 1 cycle (reaction at 94°C for 30 sec) and 40 cycles (reaction at 94°C for 

10 sec, reaction at 55°C for 30 sec, and reaction at 72°C for 2 min and 10 sec) and stored at 
4°C. 

<PCR equipment> 

A commercial PCR device (Perkin Elmer Thermal Cycler 9700 type) was used. 

Example 19: The expression of human AID mRNA in various human organ tissues 

The expression of human AID mRNA in various human organ tissues was analyzed 
by RT-PCR according to the standard method (Immunity, 9:1-10, 1998). 
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RT-PCR was conducted by using various tissues set in the human tissue cDNA panel 
(CLONTECH) as a template according to the standard method. 

AID cDNA was amplified by primers pl7 (SEQ ID NO:26) and p26 (SEQ ID NO:28) 
prepared in the above and Taq polymerase. 

As a control, RT-PCR in the same manner was conducted using cDNA of G3PDH as 
a template and GF primer and GR primer (Immunity, 9:1-10, 1998). 

FIG. 24 shows the result. As a result, Specific expression of mRNA was confirmed 
in lymph node and tonsil. This result was consistent with the experimental result in which 
the expression of mRNA for mouse AID was observed in the various lymphatic tissues 
(Examples 8 and 9). 

On the other hand, when RT-PCR was conducted with the saturated cycle number in 
the same manner as the above in the above RT-PCR, the expression of AID mRNA was 
observed in almost all analyzed organs. 

Example 20: Localization of human AID gene on human chromosomes 

Localization of human AID gene on human chromosomes was analyzed by 

Fluorescence in situ hybridization (FISH) method (Experimental Medicine, Suppl. "Genetic 

Engineering Hand Book" published by Yodosha, 1992, p. 271-277). 

Genomic DNA including human AID gene (exon 1 to exon 5), isolated in the above, 

which was labeled with biotin-1 1-dUTP (Sigma) by the nick translation method was used as 

a probe for hybridization. 

The probe was hybridized with chromosomes in metaphase human cells. 

Hybridization signals were detected using fluorescein isothiocyanate-avidin (DCS) (Vector 

Laboratories). 

FIG. 25 shows the result. As a result, human AID gene was found to be localized on 
chromosome 12pl3. Moreover, this location was revealed to be near 12pl3.1 which is the 
location for the above APOBEC-1 comprising a relatively high amino acid sequence 
homology with the AID protein and has the same cytidine deaminase activity as that of the 
AID protein. 

It has been reported that some abnormality on human chromosome locus 12pl3.3- 
12pl 1.2, 12pl3.2-12p24.1 and 12pl3 may be involved in diseases such as acrocallosal 
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syndrome, inflammatory bowel syndrome familial periodic fever, respectively, however, 
causative gene thereof has not been traced yet. It has been suggested that human AID gene 
of the present invention may be involved in sideration of such diseases. 

All references and patents cited herein are incorporated by reference in their entirety. 

INDUSTRIAL APPLICABILITY 

The AID protein of the present invention can be considered to have a function of 
regulating various biological mechanisms required for generation of antigen-specific 
immunoglobulins (specific antibodies), which eliminate non-self antigen (foreign antigen, 
self-reacting cells, etc.) that triggers various diseases. More specifically, the AID protein of 
the present invention can be considered to be one of the enzyme that plays an important role 
in the genetic editing such as RNA editing and so on occurring in germinal center B cells, 
such as activation of B cells, class switch recombination of immunoglobulin gene, somatic 
hypermutation, and affinity maturation, which are the mechanisms for generation of 
immunoglobulin having high specificity to antigens. 

The dysfunction of the AID protein of the present invention can be the cause for the 
humoral immunodeficiency since it induces failure of germinal center B cell function, such 
as antigen-specific B cell activation, class switch recombination, and somatic mutation. 
Reversely, the breakdown of the regulation of AID protein may induce allergy disease or 
autoimmune disease since it can cause inappropriate B cell activation and needless class 
switch recombination and somatic mutation. 

Therefore, regulation of the function of AID protein and the gene encoding it enables 
preventing and treating various immunodeficiencies, autoimmune diseases, and allergies, 
which result from, for example, B cell dysfunctions (e.g. IgA deficiency, IgA nephropathy, 
y globulinemia, hyper IgM syndrome, etc.) or class switch deficiency of immunoglobulin. 
Thus, the AID protein and the gene encoding the AID protein can be targets for the 
development of drugs for therapy of diseases mentioned above. 

Examples of diseases whose onset prevention, symptom remission, therapy and/or 
symptomatic treatment effect is expected by regulating the function of the AID protein of the 
present invention or the gene encoding it include, for example, primary immunodeficiency 
syndrome with congenital disorder of immune system, mainly immunodeficiency considered 



77 



to develop by B cell deficiency, decrease, or dysfunction (e.g. sex-linked 
agammaglobulinemia, sex-linked agammaglobulinemia with growth hormone deficiency, 
immunoglobulin deficiency with high IgM level, selective IgM deficiency, selective IgE 
deficiency, immunoglobulin heavy chain gene deletion, k chain deficiency, IgA deficiency, 
5 IgG subclass selective deficiency, CVID (common variable immunodeficiency), infantile 
transient dysgammaglobulinemia, Rosen syndrome, severe combined immunodeficiency 
(sex-linked, autosomal recessive), ADA (adenosine deaminase) deficiency, PNP (purine 
nucleoside phosphorylase) deficiency, MHC class II deficiency, reticular dysplasia, Wiskott- 
Aldrich syndrome, ataxia telangiectasia, DiGeorge syndrome, chromosomal aberration, 
10 familial Ig hypermetabolism, hyper IgE syndrome, Gitlin syndrome, Nezelof syndrome, 
Good syndrome, osteodystrophy, transcobalamin syndrome, secretary bead syndrome, etc.), 
various diseases with antibody production deficiency that are secondary immunodeficiency 
g syndrome with disorder of immune system caused by an acquired etiology (for example, 

AIDS, etc.), and/or various allergic diseases (e.g., bronchial asthma, atopic dermatitis, 
01 15 conjunctivitis, allergic rhinitis, allergic enteritis, drug-induced allergy, food allergy, allergic 
f& urticaria, glomerulonephritis, etc.). These could be targets for drug development. 

03 Namely, the AID protein of the present invention, a fragment thereof, a DNA 

s encoding the AID protein, a fragment thereof, and an antibody against the AID protein are 

if useful as reagents for developing drugs for prevention and therapy of such diseases, 

fy 20 Also, the DNA itself is useful as an antisense drug regulating the function of AID 

q gene at a gene level and in a use in gene therapy. The protein or the fragments thereof (e.g. 

*** enzyme active site) itself is useful as a drug. 

Furthermore, an antibody reactive to the AID protein of the present invention or a 
fragment thereof is extremely useful as an antibody drug by regulating functions of the AID 
25 protein. 

Furthermore, the gene (DNA), protein, and antibody of the present invention are 
useful as reagents for searching substrates (e.g., RNA, etc.) interacting (binding) with the 
protein (enzyme) of the present invention, or other auxiliary proteins associated with the 
protein of the present invention, and for developing drugs targeting the substrates and 
30 auxiliary proteins. 
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Furthermore, a method for identifying a substance that regulates production of the 
AID protein of the present invention or transcription of a gene encoding the AID protein into 
mRNA are extremely useful as means to develop drugs for therapy and prevention of various 
diseases (especially, immunodeficiency and allergic disease) in which the above-mentioned 
AID protein or AID gene is considered to be involved. 
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